assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	1	122934-123024	1	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GTGCGCCCCCCGCGCCCGCGGCGGGCGCGG	30	1	2	122964-122994|122964-122994	CP012670.1_3870041-3870011|CP012670.1_3048143-3048113	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|187aa|up_5|CP012670.1_113195_113756_+,NA|288aa|up_3|CP012670.1_115798_116662_-,NA|48aa|down_0|CP012670.1_123088_123232_-,NA|242aa|down_1|CP012670.1_123379_124105_+,NA|244aa|down_3|CP012670.1_125023_125755_+,NA|516aa|down_4|CP012670.1_125820_127368_-,NA|338aa|down_8|CP012670.1_131261_132275_-	NA|607aa|up_9|CP012670.1_107909_109730_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|449aa|up_8|CP012670.1_109986_111333_-	PRK04531, PRK04531, acetylglutamate kinase; Provisional	NA|333aa|up_7|CP012670.1_111332_112331_-	PRK04523, PRK04523, N-acetylornithine carbamoyltransferase; Reviewed	NA|115aa|up_6|CP012670.1_112549_112894_-	cd02407, PTH2_family, Peptidyl-tRNA hydrolase, type 2 (PTH2)_like 	NA|187aa|up_5|CP012670.1_113195_113756_+	NA	NA|468aa|up_4|CP012670.1_114308_115712_-	pfam06224, HTH_42, Winged helix DNA-binding domain	NA|288aa|up_3|CP012670.1_115798_116662_-	NA	NA|1270aa|up_2|CP012670.1_116840_120650_-	COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) [DNA replication, recombination, and repair]	NA|175aa|up_1|CP012670.1_120791_121316_-	COG3465, COG3465, Uncharacterized conserved protein [Function unknown]	NA|466aa|up_0|CP012670.1_121341_122739_-	COG1078, COG1078, HD superfamily phosphohydrolases [General function prediction only]	NA|48aa|down_0|CP012670.1_123088_123232_-	NA	NA|242aa|down_1|CP012670.1_123379_124105_+	NA	NA|303aa|down_2|CP012670.1_124101_125010_+	cd13962, PT_UbiA_UBIAD1, 1,4-Dihydroxy-2-naphthoate octaprenyltransferase	NA|244aa|down_3|CP012670.1_125023_125755_+	NA	NA|516aa|down_4|CP012670.1_125820_127368_-	NA	NA|434aa|down_5|CP012670.1_128400_129702_+	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|219aa|down_6|CP012670.1_129814_130471_+	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|198aa|down_7|CP012670.1_130528_131122_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|338aa|down_8|CP012670.1_131261_132275_-	NA	NA|1358aa|down_9|CP012670.1_132289_136363_-	pfam13665, DUF4150, Domain of unknown function (DUF4150)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	3	399559-401139	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	csb2gr5,cas7,cas8u1,cas3,cas1,cas2	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Unclear	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	21,21,21	21	Unclear	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|273aa|up_3|CP012670.1_395693_396512_-,NA|274aa|up_0|CP012670.1_398028_398850_-,cas8u1|313aa|down_7|CP012670.1_408636_409575_-	NA|205aa|up_9|CP012670.1_386683_387298_-	pfam05685, Uma2, Putative restriction endonuclease	NA|143aa|up_8|CP012670.1_387376_387805_-	pfam01124, MAPEG, MAPEG family	csb2gr5|562aa|up_7|CP012670.1_388021_389707_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|325aa|up_6|CP012670.1_389706_390681_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas8u1|753aa|up_5|CP012670.1_390683_392942_-	TIGR04113, hypothetical_protein_AaLAA1DRAFT_1703, CRISPR-associated protein Csx17, subtype Dpsyc	cas3|882aa|up_4|CP012670.1_392938_395584_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	NA|273aa|up_3|CP012670.1_395693_396512_-	NA	cas1|313aa|up_2|CP012670.1_396598_397537_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|97aa|up_1|CP012670.1_397662_397953_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|274aa|up_0|CP012670.1_398028_398850_-	NA	NA|420aa|down_0|CP012670.1_401353_402613_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|186aa|down_1|CP012670.1_403436_403994_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|118aa|down_2|CP012670.1_404038_404392_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|206aa|down_3|CP012670.1_405037_405655_-	pfam13521, AAA_28, AAA domain	NA|331aa|down_4|CP012670.1_405662_406655_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|364aa|down_5|CP012670.1_406651_407743_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|249aa|down_6|CP012670.1_407739_408486_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	cas8u1|313aa|down_7|CP012670.1_408636_409575_-	NA	cas3|1024aa|down_8|CP012670.1_409571_412643_-	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	csb2gr5|564aa|down_9|CP012670.1_412635_414327_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	4	402638-403409	2,4,2	PILER-CR,CRISPRCasFinder,CRT	no	csb2gr5,cas7,cas8u1,cas3,cas1,cas2	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Unclear	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	10,10,10	10	Unclear	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|273aa|up_4|CP012670.1_395693_396512_-,NA|274aa|up_1|CP012670.1_398028_398850_-,cas8u1|313aa|down_6|CP012670.1_408636_409575_-	NA|143aa|up_9|CP012670.1_387376_387805_-	pfam01124, MAPEG, MAPEG family	csb2gr5|562aa|up_8|CP012670.1_388021_389707_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|325aa|up_7|CP012670.1_389706_390681_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas8u1|753aa|up_6|CP012670.1_390683_392942_-	TIGR04113, hypothetical_protein_AaLAA1DRAFT_1703, CRISPR-associated protein Csx17, subtype Dpsyc	cas3|882aa|up_5|CP012670.1_392938_395584_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	NA|273aa|up_4|CP012670.1_395693_396512_-	NA	cas1|313aa|up_3|CP012670.1_396598_397537_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|97aa|up_2|CP012670.1_397662_397953_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|274aa|up_1|CP012670.1_398028_398850_-	NA	NA|420aa|up_0|CP012670.1_401353_402613_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|186aa|down_0|CP012670.1_403436_403994_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|118aa|down_1|CP012670.1_404038_404392_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|206aa|down_2|CP012670.1_405037_405655_-	pfam13521, AAA_28, AAA domain	NA|331aa|down_3|CP012670.1_405662_406655_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|364aa|down_4|CP012670.1_406651_407743_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|249aa|down_5|CP012670.1_407739_408486_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	cas8u1|313aa|down_6|CP012670.1_408636_409575_-	NA	cas3|1024aa|down_7|CP012670.1_409571_412643_-	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	csb2gr5|564aa|down_8|CP012670.1_412635_414327_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|426aa|down_9|CP012670.1_414326_415604_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	5	404634-404960	3,5,3	PILER-CR,CRISPRCasFinder,CRT	no	csb2gr5,cas7,cas8u1,cas3,cas1,cas2	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Unclear	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	4,4,4	4	Unclear	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|273aa|up_6|CP012670.1_395693_396512_-,NA|274aa|up_3|CP012670.1_398028_398850_-,cas8u1|313aa|down_4|CP012670.1_408636_409575_-	cas7|325aa|up_9|CP012670.1_389706_390681_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas8u1|753aa|up_8|CP012670.1_390683_392942_-	TIGR04113, hypothetical_protein_AaLAA1DRAFT_1703, CRISPR-associated protein Csx17, subtype Dpsyc	cas3|882aa|up_7|CP012670.1_392938_395584_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	NA|273aa|up_6|CP012670.1_395693_396512_-	NA	cas1|313aa|up_5|CP012670.1_396598_397537_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|97aa|up_4|CP012670.1_397662_397953_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|274aa|up_3|CP012670.1_398028_398850_-	NA	NA|420aa|up_2|CP012670.1_401353_402613_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|186aa|up_1|CP012670.1_403436_403994_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|118aa|up_0|CP012670.1_404038_404392_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|206aa|down_0|CP012670.1_405037_405655_-	pfam13521, AAA_28, AAA domain	NA|331aa|down_1|CP012670.1_405662_406655_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|364aa|down_2|CP012670.1_406651_407743_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|249aa|down_3|CP012670.1_407739_408486_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	cas8u1|313aa|down_4|CP012670.1_408636_409575_-	NA	cas3|1024aa|down_5|CP012670.1_409571_412643_-	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	csb2gr5|564aa|down_6|CP012670.1_412635_414327_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|426aa|down_7|CP012670.1_414326_415604_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas1|663aa|down_8|CP012670.1_416301_418290_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|97aa|down_9|CP012670.1_418449_418740_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	6	420381-423904	4,6,4	PILER-CR,CRISPRCasFinder,CRT	no	cas8u1,cas3,csb2gr5,cas7,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Type III-A,Type III-D,Type III-C,Type III-B	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	48,48,48	48	TypeIII-A,TypeIII-D,TypeIII-C,TypeIII-B	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	cas8u1|313aa|up_6|CP012670.1_408636_409575_-,NA|274aa|up_0|CP012670.1_418850_419672_-,NA	NA|331aa|up_9|CP012670.1_405662_406655_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|364aa|up_8|CP012670.1_406651_407743_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial	NA|249aa|up_7|CP012670.1_407739_408486_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	cas8u1|313aa|up_6|CP012670.1_408636_409575_-	NA	cas3|1024aa|up_5|CP012670.1_409571_412643_-	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	csb2gr5|564aa|up_4|CP012670.1_412635_414327_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas7|426aa|up_3|CP012670.1_414326_415604_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	cas1|663aa|up_2|CP012670.1_416301_418290_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|97aa|up_1|CP012670.1_418449_418740_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|274aa|up_0|CP012670.1_418850_419672_-	NA	NA|208aa|down_0|CP012670.1_424468_425092_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|947aa|down_1|CP012670.1_425261_428102_+	PHA03307, PHA03307, transcriptional regulator ICP4; Provisional	NA|1142aa|down_2|CP012670.1_427837_431263_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|267aa|down_3|CP012670.1_431552_432353_+	TIGR02984, Sig-70_plancto1, RNA polymerase sigma-70 factor, Planctomycetaceae-specific subfamily 1	NA|285aa|down_4|CP012670.1_432375_433230_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|338aa|down_5|CP012670.1_433361_434375_+	cd05400, NT_2-5OAS_ClassI-CCAase, Nucleotidyltransferase (NT) domain of 2'5'-oligoadenylate (2-5A)synthetase (2-5OAS) and class I CCA-adding enzyme	NA|583aa|down_6|CP012670.1_434400_436149_+	pfam18145, SAVED, SMODS-associated and fused to various effectors sensor domain	cas10|617aa|down_7|CP012670.1_436203_438054_+	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr3gr5|392aa|down_8|CP012670.1_438050_439226_+	pfam09700, Cas_Cmr3, CRISPR-associated protein (Cas_Cmr3)	cmr1gr7|416aa|down_9|CP012670.1_439222_440470_+	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	7	499103-499204	7	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GCGGTGACCGCGCCCGCGCGGGAG	24	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|426aa|up_8|CP012670.1_477823_479101_-,NA|133aa|up_5|CP012670.1_483339_483738_-,NA|276aa|down_3|CP012670.1_504520_505348_-,NA|194aa|down_7|CP012670.1_510751_511333_+	NA|155aa|up_9|CP012670.1_477335_477800_-	cd00322, FNR_like, Ferredoxin reductase (FNR), an FAD and NAD(P) binding protein, was intially identified as a chloroplast reductase activity, catalyzing the electron transfer from reduced iron-sulfur protein ferredoxin to NADP+ as the final step in the electron transport mechanism of photosystem I	NA|426aa|up_8|CP012670.1_477823_479101_-	NA	NA|689aa|up_7|CP012670.1_479097_481164_-	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|615aa|up_6|CP012670.1_481167_483012_-	COG3209, RhsA, Rhs family protein [Cell envelope biogenesis, outer membrane]	NA|133aa|up_5|CP012670.1_483339_483738_-	NA	NA|197aa|up_4|CP012670.1_483849_484440_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|1575aa|up_3|CP012670.1_484540_489265_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|315aa|up_2|CP012670.1_489277_490222_-	PRK01973, PRK01973, septum site-determining protein MinC	NA|388aa|up_1|CP012670.1_490522_491686_+	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|1390aa|up_0|CP012670.1_492518_496688_-	COG3209, RhsA, Rhs family protein [Cell envelope biogenesis, outer membrane]	NA|407aa|down_0|CP012670.1_499229_500450_+	PRK01346, PRK01346, enhanced intracellular survival protein Eis	NA|182aa|down_1|CP012670.1_500453_500999_+	smart00271, DnaJ, DnaJ molecular chaperone homology domain	NA|1107aa|down_2|CP012670.1_500995_504316_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|276aa|down_3|CP012670.1_504520_505348_-	NA	NA|238aa|down_4|CP012670.1_505782_506496_-	cd02969, PRX_like1, Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs	NA|119aa|down_5|CP012670.1_506707_507064_-	pfam07883, Cupin_2, Cupin domain	NA|580aa|down_6|CP012670.1_507177_508917_-	pfam13598, DUF4139, Domain of unknown function (DUF4139)	NA|194aa|down_7|CP012670.1_510751_511333_+	NA	NA|565aa|down_8|CP012670.1_511281_512976_-	NF033452, BREX_1_MTaseX, BREX-1 system adenine-specific DNA-methyltransferase PglX	NA|603aa|down_9|CP012670.1_513144_514953_+	PRK10565, PRK10565, putative carbohydrate kinase; Provisional
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	8	590432-590673	5	CRT	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GGTTGGCCGACCGAGGTTGACCGAC	25	0	0	NA	NA	NA	4	4	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|361aa|up_4|CP012670.1_580143_581226_-,NA|332aa|down_7|CP012670.1_600508_601504_-	NA|176aa|up_9|CP012670.1_575491_576019_-	pfam04115, Ureidogly_lyase, Ureidoglycolate lyase	NA|209aa|up_8|CP012670.1_576407_577034_+	pfam04011, LemA, LemA family	NA|267aa|up_7|CP012670.1_577033_577834_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|404aa|up_6|CP012670.1_578064_579276_+	cd00751, thiolase, Thiolase are ubiquitous enzymes that catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine	NA|190aa|up_5|CP012670.1_579410_579980_+	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|361aa|up_4|CP012670.1_580143_581226_-	NA	NA|328aa|up_3|CP012670.1_581382_582366_-	smart00880, CHAD, The CHAD domain is an alpha-helical domain functionally associated with some members of the adenylate cyclase family	NA|564aa|up_2|CP012670.1_582362_584054_-	COG0595, COG0595, mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair]	NA|597aa|up_1|CP012670.1_584379_586170_+	cd07023, S49_Sppa_N_C, Signal peptide peptidase A (SppA), a serine protease, has catalytic Ser-Lys dyad	NA|1140aa|up_0|CP012670.1_586727_590147_+	PRK07562, PRK07562, vitamin B12-dependent ribonucleotide reductase	NA|348aa|down_0|CP012670.1_591056_592100_-	cd02194, ThiL, ThiL (Thiamine-monophosphate kinase) plays a dual role in de novo biosynthesis and in salvage of exogenous thiamine	NA|534aa|down_1|CP012670.1_592096_593698_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|231aa|down_2|CP012670.1_593694_594387_-	cd00882, Ras_like_GTPase, Rat sarcoma (Ras)-like superfamily of small guanosine triphosphatases (GTPases)	NA|606aa|down_3|CP012670.1_594531_596349_-	cd09912, DLP_2, Dynamin-like protein including dynamins, mitofusins, and guanylate-binding proteins	NA|298aa|down_4|CP012670.1_596393_597287_-	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|343aa|down_5|CP012670.1_597376_598405_-	cd05153, HomoserineK_II, Type II Homoserine Kinase	NA|627aa|down_6|CP012670.1_598582_600463_+	TIGR03156, GTP_HflX, GTP-binding protein HflX	NA|332aa|down_7|CP012670.1_600508_601504_-	NA	NA|400aa|down_8|CP012670.1_601514_602714_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|203aa|down_9|CP012670.1_602869_603478_-	COG1648, CysG, Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) [Coenzyme metabolism]
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	9	672515-672612	8	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GGCCGCGCTCGAGCAGGGGCTCGC	24	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|279aa|up_8|CP012670.1_662746_663583_-,NA|177aa|up_7|CP012670.1_663660_664191_+,NA|382aa|up_6|CP012670.1_664418_665564_+,NA	NA|287aa|up_9|CP012670.1_661113_661974_-	pfam00494, SQS_PSY, Squalene/phytoene synthase	NA|279aa|up_8|CP012670.1_662746_663583_-	NA	NA|177aa|up_7|CP012670.1_663660_664191_+	NA	NA|382aa|up_6|CP012670.1_664418_665564_+	NA	NA|156aa|up_5|CP012670.1_665620_666088_+	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|503aa|up_4|CP012670.1_666125_667634_-	cd05945, DltA, D-alanine:D-alanyl carrier protein ligase (DltA) and similar proteins	NA|88aa|up_3|CP012670.1_667644_667908_-	pfam00550, PP-binding, Phosphopantetheine attachment site	NA|462aa|up_2|CP012670.1_667968_669354_-	TIGR01137, Cystathionine_beta-synthase, cystathionine beta-synthase	NA|145aa|up_1|CP012670.1_669809_670244_+	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|86aa|up_0|CP012670.1_670815_671073_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|632aa|down_0|CP012670.1_672654_674550_-	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|158aa|down_1|CP012670.1_674551_675025_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|287aa|down_2|CP012670.1_675268_676129_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|316aa|down_3|CP012670.1_676256_677204_+	PRK09348, glyQ, glycyl-tRNA synthetase subunit alpha; Validated	NA|193aa|down_4|CP012670.1_677242_677821_-	PRK00455, pyrE, orotate phosphoribosyltransferase; Validated	NA|239aa|down_5|CP012670.1_677817_678534_-	pfam04519, Bactofilin, Polymer-forming cytoskeletal	NA|105aa|down_6|CP012670.1_678537_678852_-	COG1664, CcmA, Integral membrane protein CcmA involved in cell shape determination [Cell envelope biogenesis, outer membrane]	NA|142aa|down_7|CP012670.1_678906_679332_-	pfam04519, Bactofilin, Polymer-forming cytoskeletal	NA|464aa|down_8|CP012670.1_680242_681634_+	pfam17885, Smoa_sbd, Styrene monooxygenase A putative substrate binding domain	NA|1102aa|down_9|CP012670.1_681677_684983_-	cd01465, vWA_subgroup, VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	10	831603-831683	9	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CCCCCTCCGTGGGCCGGGCAGCTCGGC	27	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|125aa|up_4|CP012670.1_824575_824950_+,NA|230aa|up_3|CP012670.1_825282_825972_+,NA|178aa|up_1|CP012670.1_827476_828010_-,NA|218aa|down_3|CP012670.1_836377_837031_-,NA|82aa|down_5|CP012670.1_838782_839028_+	NA|326aa|up_9|CP012670.1_818445_819423_-	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|445aa|up_8|CP012670.1_819829_821164_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|149aa|up_7|CP012670.1_821237_821684_-	PRK00872, PRK00872, hypothetical protein; Provisional	NA|409aa|up_6|CP012670.1_821680_822907_-	PRK07538, PRK07538, hypothetical protein; Provisional	NA|367aa|up_5|CP012670.1_823042_824143_-	COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|125aa|up_4|CP012670.1_824575_824950_+	NA	NA|230aa|up_3|CP012670.1_825282_825972_+	NA	NA|333aa|up_2|CP012670.1_825999_826998_-	COG2605, COG2605, Predicted kinase related to galactokinase and mevalonate kinase [General function prediction only]	NA|178aa|up_1|CP012670.1_827476_828010_-	NA	NA|776aa|up_0|CP012670.1_828293_830621_+	COG3973, COG3973, Superfamily I DNA and RNA helicases [General function prediction only]	NA|644aa|down_0|CP012670.1_831990_833922_+	cd12131, HGbI-like, Hell's gate globin I (HGbI) from Methylacidophilum infernorum and related proteins	NA|620aa|down_1|CP012670.1_834025_835885_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|133aa|down_2|CP012670.1_835964_836363_-	cd03209, GST_C_Mu, C-terminal, alpha helical domain of Class Mu Glutathione S-transferases	NA|218aa|down_3|CP012670.1_836377_837031_-	NA	NA|428aa|down_4|CP012670.1_837337_838621_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|82aa|down_5|CP012670.1_838782_839028_+	NA	NA|313aa|down_6|CP012670.1_839136_840075_-	PRK05710, PRK05710, tRNA glutamyl-Q(34) synthetase GluQRS	NA|623aa|down_7|CP012670.1_840172_842041_-	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|264aa|down_8|CP012670.1_842629_843421_+	PHA03169, PHA03169, hypothetical protein; Provisional	NA|127aa|down_9|CP012670.1_843461_843842_-	cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of alpha-crystallin-type small(s) heat shock proteins (Hsps)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	11	1096375-1096468	10	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	TCCGTCGCCGCAGCGCGCAGCGCGT	25	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA,NA|521aa|down_3|CP012670.1_1099834_1101397_+	NA|958aa|up_9|CP012670.1_1067671_1070545_-	PRK11091, PRK11091, aerobic respiration control sensor protein ArcB; Provisional	NA|420aa|up_8|CP012670.1_1070899_1072159_+	PRK07538, PRK07538, hypothetical protein; Provisional	NA|299aa|up_7|CP012670.1_1072332_1073229_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|292aa|up_6|CP012670.1_1073317_1074193_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|346aa|up_5|CP012670.1_1074462_1075500_+	cd01833, XynB_like, SGNH_hydrolase subfamily, similar to Ruminococcus flavefaciens XynB	NA|550aa|up_4|CP012670.1_1076079_1077729_+	cd04742, NPD_FabD, 2-Nitropropane dioxygenase (NPD)-like domain, associated with the (acyl-carrier-protein) S-malonyltransferase  FabD	NA|2505aa|up_3|CP012670.1_1077757_1085272_+	TIGR02813, omega-3_polyunsaturated_fatty_acid_synthase_PfaA, polyketide-type polyunsaturated fatty acid synthase PfaA	NA|217aa|up_2|CP012670.1_1093224_1093875_-	pfam14103, DUF4276, Domain of unknown function (DUF4276)	NA|480aa|up_1|CP012670.1_1093871_1095311_-	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|178aa|up_0|CP012670.1_1095807_1096341_+	PRK11854, aceF, pyruvate dehydrogenase dihydrolipoyltransacetylase; Validated	NA|74aa|down_0|CP012670.1_1096549_1096771_-	pfam04325, DUF465, Protein of unknown function (DUF465)	NA|380aa|down_1|CP012670.1_1097136_1098276_+	TIGR02795, Uncharacterized_protein_in_oprL_3'region, tol-pal system protein YbgF	NA|385aa|down_2|CP012670.1_1098301_1099456_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|521aa|down_3|CP012670.1_1099834_1101397_+	NA	NA|234aa|down_4|CP012670.1_1101431_1102133_-	COG0125, Tmk, Thymidylate kinase [Nucleotide transport and metabolism]	NA|298aa|down_5|CP012670.1_1102129_1103023_-	pfam02569, Pantoate_ligase, Pantoate-beta-alanine ligase	NA|798aa|down_6|CP012670.1_1103066_1105460_+	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|276aa|down_7|CP012670.1_1105594_1106422_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|294aa|down_8|CP012670.1_1106453_1107335_-	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|153aa|down_9|CP012670.1_1107568_1108027_+	PRK05163, rpsL, 30S ribosomal protein S12; Validated
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	12	1567881-1568010	11	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CCCCCACGCCGCCCCACGACACGAACACCCTCTCCGGGGGCCTACGG	47	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|80aa|up_2|CP012670.1_1564766_1565006_+,NA|195aa|up_1|CP012670.1_1565055_1565640_+,NA|191aa|down_0|CP012670.1_1568077_1568650_-,NA|482aa|down_6|CP012670.1_1578840_1580286_+	NA|552aa|up_9|CP012670.1_1552879_1554535_-	COG0578, GlpA, Glycerol-3-phosphate dehydrogenase [Energy production and conversion]	NA|679aa|up_8|CP012670.1_1554748_1556785_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|167aa|up_7|CP012670.1_1557324_1557825_+	cd00156, REC, phosphoacceptor receiver (REC) domain of response regulators (RRs) and pseudo response regulators (PRRs)	NA|477aa|up_6|CP012670.1_1557881_1559312_+	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|290aa|up_5|CP012670.1_1559609_1560479_+	pfam00089, Trypsin, Trypsin	NA|705aa|up_4|CP012670.1_1560491_1562606_-	cd08662, M13, Peptidase family M13 includes neprilysin and endothelin-converting enzyme I	NA|491aa|up_3|CP012670.1_1562733_1564206_-	TIGR03355, VI_chp_2, type VI secretion protein, EvpB/VC_A0108 family	NA|80aa|up_2|CP012670.1_1564766_1565006_+	NA	NA|195aa|up_1|CP012670.1_1565055_1565640_+	NA	NA|718aa|up_0|CP012670.1_1565636_1567790_+	cd07550, P-type_ATPase_HM, P-type heavy metal-transporting ATPase; uncharacterized subfamily	NA|191aa|down_0|CP012670.1_1568077_1568650_-	NA	NA|404aa|down_1|CP012670.1_1569456_1570668_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|369aa|down_2|CP012670.1_1570840_1571947_+	COG0287, TyrA, Prephenate dehydrogenase [Amino acid transport and metabolism]	NA|711aa|down_3|CP012670.1_1572677_1574810_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|134aa|down_4|CP012670.1_1575288_1575690_+	PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional	NA|824aa|down_5|CP012670.1_1575850_1578322_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|482aa|down_6|CP012670.1_1578840_1580286_+	NA	NA|157aa|down_7|CP012670.1_1580427_1580898_-	COG3265, GntK, Gluconate kinase [Carbohydrate transport and metabolism]	NA|439aa|down_8|CP012670.1_1581124_1582441_+	cd19590, serpin_thermopin-like, serpin thermopin and similar proteins	NA|221aa|down_9|CP012670.1_1582454_1583117_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	13	1630911-1631021	12	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	TGGAACTCGGTCGAGGGCGCCGTCGGAGGAGCCCATCGG	39	1	1	1630950-1630982	CP012670.1_2418037-2418005	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|346aa|up_7|CP012670.1_1621069_1622107_-,NA|323aa|up_5|CP012670.1_1624293_1625262_-,NA|81aa|up_1|CP012670.1_1630008_1630251_+,NA|152aa|down_1|CP012670.1_1632102_1632558_+,NA|67aa|down_7|CP012670.1_1638542_1638743_+,NA|100aa|down_8|CP012670.1_1638755_1639055_+	NA|407aa|up_9|CP012670.1_1618996_1620217_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|134aa|up_8|CP012670.1_1620575_1620977_+	cd07262, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|346aa|up_7|CP012670.1_1621069_1622107_-	NA	NA|339aa|up_6|CP012670.1_1622715_1623732_-	PRK05687, fliH, flagellar assembly protein FliH	NA|323aa|up_5|CP012670.1_1624293_1625262_-	NA	NA|959aa|up_4|CP012670.1_1625563_1628440_-	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	NA|137aa|up_3|CP012670.1_1628963_1629374_+	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|153aa|up_2|CP012670.1_1629404_1629863_-	PRK10996, PRK10996, thioredoxin 2; Provisional	NA|81aa|up_1|CP012670.1_1630008_1630251_+	NA	NA|157aa|up_0|CP012670.1_1630333_1630804_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|134aa|down_0|CP012670.1_1631199_1631601_+	TIGR00927, retinal_rod, K+-dependent Na+/Ca+ exchanger	NA|152aa|down_1|CP012670.1_1632102_1632558_+	NA	NA|379aa|down_2|CP012670.1_1632557_1633694_+	pfam00665, rve, Integrase core domain	NA|387aa|down_3|CP012670.1_1633736_1634897_-	pfam00665, rve, Integrase core domain	NA|367aa|down_4|CP012670.1_1636747_1637848_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|124aa|down_5|CP012670.1_1637927_1638299_-	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|145aa|down_6|CP012670.1_1638295_1638730_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|67aa|down_7|CP012670.1_1638542_1638743_+	NA	NA|100aa|down_8|CP012670.1_1638755_1639055_+	NA	NA|120aa|down_9|CP012670.1_1640002_1640362_+	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	14	1635328-1636533	5,13,6	PILER-CR,CRISPRCasFinder,CRT	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	16,16,16	16	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|323aa|up_9|CP012670.1_1624293_1625262_-,NA|81aa|up_5|CP012670.1_1630008_1630251_+,NA|152aa|up_2|CP012670.1_1632102_1632558_+,NA|67aa|down_3|CP012670.1_1638542_1638743_+,NA|100aa|down_4|CP012670.1_1638755_1639055_+,NA|95aa|down_7|CP012670.1_1641844_1642129_-	NA|323aa|up_9|CP012670.1_1624293_1625262_-	NA	NA|959aa|up_8|CP012670.1_1625563_1628440_-	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	NA|137aa|up_7|CP012670.1_1628963_1629374_+	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|153aa|up_6|CP012670.1_1629404_1629863_-	PRK10996, PRK10996, thioredoxin 2; Provisional	NA|81aa|up_5|CP012670.1_1630008_1630251_+	NA	NA|157aa|up_4|CP012670.1_1630333_1630804_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|134aa|up_3|CP012670.1_1631199_1631601_+	TIGR00927, retinal_rod, K+-dependent Na+/Ca+ exchanger	NA|152aa|up_2|CP012670.1_1632102_1632558_+	NA	NA|379aa|up_1|CP012670.1_1632557_1633694_+	pfam00665, rve, Integrase core domain	NA|387aa|up_0|CP012670.1_1633736_1634897_-	pfam00665, rve, Integrase core domain	NA|367aa|down_0|CP012670.1_1636747_1637848_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|124aa|down_1|CP012670.1_1637927_1638299_-	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|145aa|down_2|CP012670.1_1638295_1638730_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|67aa|down_3|CP012670.1_1638542_1638743_+	NA	NA|100aa|down_4|CP012670.1_1638755_1639055_+	NA	NA|120aa|down_5|CP012670.1_1640002_1640362_+	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|378aa|down_6|CP012670.1_1640361_1641495_+	pfam00665, rve, Integrase core domain	NA|95aa|down_7|CP012670.1_1641844_1642129_-	NA	NA|575aa|down_8|CP012670.1_1642227_1643952_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|535aa|down_9|CP012670.1_1643955_1645560_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	15	1639075-1639845	6,14,7,7	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36,36	0	0	NA	NA	NA:NA:NA:NA	8,10,10,8	10	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|152aa|up_7|CP012670.1_1632102_1632558_+,NA|67aa|up_1|CP012670.1_1638542_1638743_+,NA|100aa|up_0|CP012670.1_1638755_1639055_+,NA|95aa|down_2|CP012670.1_1641844_1642129_-	NA|157aa|up_9|CP012670.1_1630333_1630804_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|134aa|up_8|CP012670.1_1631199_1631601_+	TIGR00927, retinal_rod, K+-dependent Na+/Ca+ exchanger	NA|152aa|up_7|CP012670.1_1632102_1632558_+	NA	NA|379aa|up_6|CP012670.1_1632557_1633694_+	pfam00665, rve, Integrase core domain	NA|387aa|up_5|CP012670.1_1633736_1634897_-	pfam00665, rve, Integrase core domain	NA|367aa|up_4|CP012670.1_1636747_1637848_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|124aa|up_3|CP012670.1_1637927_1638299_-	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|145aa|up_2|CP012670.1_1638295_1638730_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|67aa|up_1|CP012670.1_1638542_1638743_+	NA	NA|100aa|up_0|CP012670.1_1638755_1639055_+	NA	NA|120aa|down_0|CP012670.1_1640002_1640362_+	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|378aa|down_1|CP012670.1_1640361_1641495_+	pfam00665, rve, Integrase core domain	NA|95aa|down_2|CP012670.1_1641844_1642129_-	NA	NA|575aa|down_3|CP012670.1_1642227_1643952_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|535aa|down_4|CP012670.1_1643955_1645560_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|798aa|down_5|CP012670.1_1645656_1648050_-	pfam01186, Lysyl_oxidase, Lysyl oxidase	NA|262aa|down_6|CP012670.1_1648155_1648941_+	cd05233, SDR_c, classical (c) SDRs	NA|776aa|down_7|CP012670.1_1648981_1651309_-	pfam06537, DHOR, Di-haem oxidoreductase, putative peroxidase	NA|527aa|down_8|CP012670.1_1651360_1652941_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|544aa|down_9|CP012670.1_1652993_1654625_-	pfam07631, PSD4, Protein of unknown function (DUF1592)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	16	1641529-1641844	8,8,15	CRT,PILER-CR,CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CTGAANNGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAA	26,36,34	0	0	NA	NA	NA:NA:NA	4,2,3	4	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|152aa|up_9|CP012670.1_1632102_1632558_+,NA|67aa|up_3|CP012670.1_1638542_1638743_+,NA|100aa|up_2|CP012670.1_1638755_1639055_+,NA	NA|152aa|up_9|CP012670.1_1632102_1632558_+	NA	NA|379aa|up_8|CP012670.1_1632557_1633694_+	pfam00665, rve, Integrase core domain	NA|387aa|up_7|CP012670.1_1633736_1634897_-	pfam00665, rve, Integrase core domain	NA|367aa|up_6|CP012670.1_1636747_1637848_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|124aa|up_5|CP012670.1_1637927_1638299_-	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|145aa|up_4|CP012670.1_1638295_1638730_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|67aa|up_3|CP012670.1_1638542_1638743_+	NA	NA|100aa|up_2|CP012670.1_1638755_1639055_+	NA	NA|120aa|up_1|CP012670.1_1640002_1640362_+	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|378aa|up_0|CP012670.1_1640361_1641495_+	pfam00665, rve, Integrase core domain	NA|575aa|down_0|CP012670.1_1642227_1643952_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|535aa|down_1|CP012670.1_1643955_1645560_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|798aa|down_2|CP012670.1_1645656_1648050_-	pfam01186, Lysyl_oxidase, Lysyl oxidase	NA|262aa|down_3|CP012670.1_1648155_1648941_+	cd05233, SDR_c, classical (c) SDRs	NA|776aa|down_4|CP012670.1_1648981_1651309_-	pfam06537, DHOR, Di-haem oxidoreductase, putative peroxidase	NA|527aa|down_5|CP012670.1_1651360_1652941_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|544aa|down_6|CP012670.1_1652993_1654625_-	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|301aa|down_7|CP012670.1_1654745_1655648_+	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|336aa|down_8|CP012670.1_1655651_1656659_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|322aa|down_9|CP012670.1_1656668_1657634_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	17	1660023-1660128	16	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GCGTCTCGGCAGGACGCGGCGCGTCCCC	28	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA,NA|161aa|down_0|CP012670.1_1660538_1661021_+	NA|798aa|up_9|CP012670.1_1645656_1648050_-	pfam01186, Lysyl_oxidase, Lysyl oxidase	NA|262aa|up_8|CP012670.1_1648155_1648941_+	cd05233, SDR_c, classical (c) SDRs	NA|776aa|up_7|CP012670.1_1648981_1651309_-	pfam06537, DHOR, Di-haem oxidoreductase, putative peroxidase	NA|527aa|up_6|CP012670.1_1651360_1652941_-	pfam07586, HXXSHH, Protein of unknown function (DUF1552)	NA|544aa|up_5|CP012670.1_1652993_1654625_-	pfam07631, PSD4, Protein of unknown function (DUF1592)	NA|301aa|up_4|CP012670.1_1654745_1655648_+	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|336aa|up_3|CP012670.1_1655651_1656659_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|322aa|up_2|CP012670.1_1656668_1657634_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|209aa|up_1|CP012670.1_1657626_1658253_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|210aa|up_0|CP012670.1_1659306_1659936_+	PRK13406, bchD, magnesium chelatase subunit D; Provisional	NA|161aa|down_0|CP012670.1_1660538_1661021_+	NA	NA|584aa|down_1|CP012670.1_1661237_1662989_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|1368aa|down_2|CP012670.1_1663134_1667238_+	TIGR03903, TOMM_kin_cyc, TOMM system kinase/cyclase fusion protein	NA|287aa|down_3|CP012670.1_1667234_1668095_+	TIGR03795, RNP_Burkhold, ribosomal natural product, two-chain TOMM family	NA|740aa|down_4|CP012670.1_1668120_1670340_+	pfam02624, YcaO, YcaO cyclodehydratase, ATP-ad Mg2+-binding	NA|403aa|down_5|CP012670.1_1670367_1671576_+	pfam04909, Amidohydro_2, Amidohydrolase	NA|283aa|down_6|CP012670.1_1671589_1672438_-	pfam05685, Uma2, Putative restriction endonuclease	NA|562aa|down_7|CP012670.1_1672533_1674219_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|425aa|down_8|CP012670.1_1674237_1675512_-	PRK06958, PRK06958, single-stranded DNA-binding protein; Provisional	NA|349aa|down_9|CP012670.1_1675950_1676997_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	18	2029320-2029428	17	CRISPRCasFinder	no	DEDDh	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Unclear	CCACGCCGCGCGCGCAGCGCTGAGCGCGG	29	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|377aa|up_7|CP012670.1_2021509_2022640_+,NA|185aa|up_5|CP012670.1_2024277_2024832_+,NA|269aa|up_3|CP012670.1_2025948_2026755_-,NA|222aa|up_2|CP012670.1_2026995_2027661_-,NA|346aa|down_0|CP012670.1_2029513_2030551_-,NA|172aa|down_2|CP012670.1_2031455_2031971_+,NA|537aa|down_7|CP012670.1_2036325_2037936_-	NA|168aa|up_9|CP012670.1_2019303_2019807_+	pfam14424, Toxin-deaminase, The BURPS668_1122 family of deaminases	NA|462aa|up_8|CP012670.1_2019953_2021339_-	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|377aa|up_7|CP012670.1_2021509_2022640_+	NA	NA|456aa|up_6|CP012670.1_2022776_2024144_+	cd12953, MMP_TTHA0227, Minimal MMP-like domain found in Thermus thermophilus hypothetical protein TTHA0227 and similar proteins	NA|185aa|up_5|CP012670.1_2024277_2024832_+	NA	NA|345aa|up_4|CP012670.1_2024865_2025900_+	PRK00059, prsA, peptidylprolyl isomerase; Provisional	NA|269aa|up_3|CP012670.1_2025948_2026755_-	NA	NA|222aa|up_2|CP012670.1_2026995_2027661_-	NA	NA|98aa|up_1|CP012670.1_2027766_2028060_+	pfam01329, Pterin_4a, Pterin 4 alpha carbinolamine dehydratase	NA|280aa|up_0|CP012670.1_2028086_2028926_+	PRK00102, rnc, ribonuclease III; Reviewed	NA|346aa|down_0|CP012670.1_2029513_2030551_-	NA	NA|173aa|down_1|CP012670.1_2030843_2031362_+	cd18094, SpoU-like_TrmL, SAM-dependent tRNA methylase related to TrmL	NA|172aa|down_2|CP012670.1_2031455_2031971_+	NA	NA|169aa|down_3|CP012670.1_2032156_2032663_-	COG1329, COG1329, Transcriptional regulators, similar to M	NA|312aa|down_4|CP012670.1_2033296_2034232_-	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|275aa|down_5|CP012670.1_2034242_2035067_-	COG1360, MotB, Flagellar motor protein [Cell motility and secretion]	NA|125aa|down_6|CP012670.1_2035548_2035923_+	PRK14900, valS, valyl-tRNA synthetase; Provisional	NA|537aa|down_7|CP012670.1_2036325_2037936_-	NA	NA|215aa|down_8|CP012670.1_2037964_2038609_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|222aa|down_9|CP012670.1_2038833_2039499_-	COG1403, McrA, Restriction endonuclease [Defense mechanisms]
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	19	2162194-2162290	18	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CGCGGGCCGCGCCGCGGTCAGCGCGGGCCGCGC	33	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA,NA|152aa|down_3|CP012670.1_2166754_2167210_+	NA|481aa|up_9|CP012670.1_2149153_2150596_-	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|425aa|up_8|CP012670.1_2150612_2151887_-	TIGR02427, b-ketoadipate_enol-lactone_hydrolase, 3-oxoadipate enol-lactonase	NA|124aa|up_7|CP012670.1_2151855_2152227_-	cd02226, cupin_YdbB-like, Bacillus subtilis YdbB and related proteins, cupin domain	NA|330aa|up_6|CP012670.1_2152223_2153213_-	TIGR01762, syringomycin_biosynthesis_enzyme_2, chlorinating enzyme	NA|633aa|up_5|CP012670.1_2153209_2155108_-	cd17643, A_NRPS_Cytc1-like, similar to adenylation domain of cytotrienin synthetase CytC1	NA|211aa|up_4|CP012670.1_2156040_2156673_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|341aa|up_3|CP012670.1_2156819_2157842_-	PRK12297, obgE, GTPase CgtA; Reviewed	NA|499aa|up_2|CP012670.1_2157938_2159435_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|228aa|up_1|CP012670.1_2159887_2160571_-	PRK05306, infB, translation initiation factor IF-2; Validated	NA|334aa|up_0|CP012670.1_2161188_2162190_+	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|438aa|down_0|CP012670.1_2162295_2163609_-	pfam07994, NAD_binding_5, Myo-inositol-1-phosphate synthase	NA|405aa|down_1|CP012670.1_2163908_2165123_-	TIGR03806, chp_HNE_0200, conserved hypothetical protein, HNE_0200 family	NA|473aa|down_2|CP012670.1_2165135_2166554_+	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|152aa|down_3|CP012670.1_2166754_2167210_+	NA	NA|206aa|down_4|CP012670.1_2167206_2167824_+	cd18737, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|892aa|down_5|CP012670.1_2168304_2170980_+	PRK13800, PRK13800, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|587aa|down_6|CP012670.1_2171004_2172765_+	pfam00339, Arrestin_N, Arrestin (or S-antigen), N-terminal domain	NA|109aa|down_7|CP012670.1_2172870_2173197_+	cd07177, terB_like, tellurium resistance terB-like protein	NA|185aa|down_8|CP012670.1_2173253_2173808_+	pfam00731, AIRC, AIR carboxylase	NA|397aa|down_9|CP012670.1_2173927_2175118_+	PRK06019, PRK06019, phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	21	3388196-3388272	20	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	ACGCCGTGCGCGCCCGCCCGCCCG	24	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|69aa|up_2|CP012670.1_3386336_3386543_+,NA|414aa|down_3|CP012670.1_3391918_3393160_+,NA|79aa|down_4|CP012670.1_3393190_3393427_+,NA|153aa|down_5|CP012670.1_3393330_3393789_+	NA|239aa|up_9|CP012670.1_3374009_3374726_-	pfam01863, DUF45, Protein of unknown function DUF45	NA|1084aa|up_8|CP012670.1_3374722_3377974_-	COG0610, COG0610, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|375aa|up_7|CP012670.1_3377982_3379107_-	pfam13310, Virulence_RhuM, Virulence protein RhuM family	NA|280aa|up_6|CP012670.1_3379103_3379943_-	cd17524, RMtype1_S_EcoUTORF5051P-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit TRD-CR, similar to Escherichia coli UTI89 S subunit (S	NA|800aa|up_5|CP012670.1_3380406_3382806_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|316aa|up_4|CP012670.1_3383163_3384111_+	pfam10446, DUF2457, Protein of unknown function (DUF2457)	NA|388aa|up_3|CP012670.1_3384513_3385677_-	pfam13683, rve_3, Integrase core domain	NA|69aa|up_2|CP012670.1_3386336_3386543_+	NA	NA|228aa|up_1|CP012670.1_3386590_3387274_+	pfam16554, OAM_dimer, dimerization domain of d-ornithine 4,5-aminomutase	NA|286aa|up_0|CP012670.1_3387277_3388135_-	smart00271, DnaJ, DnaJ molecular chaperone homology domain	NA|282aa|down_0|CP012670.1_3388421_3389267_+	cd05353, hydroxyacyl-CoA-like_DH_SDR_c-like, (3R)-hydroxyacyl-CoA dehydrogenase-like, classical(c)-like SDRs	NA|575aa|down_1|CP012670.1_3389628_3391353_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|168aa|down_2|CP012670.1_3391397_3391901_+	cd17036, T3SC_YbjN-like_1, T110839 is structurally similar to type III secretion system chaperones and YbjN family proteins	NA|414aa|down_3|CP012670.1_3391918_3393160_+	NA	NA|79aa|down_4|CP012670.1_3393190_3393427_+	NA	NA|153aa|down_5|CP012670.1_3393330_3393789_+	NA	NA|333aa|down_6|CP012670.1_3393791_3394790_+	cd13957, PT_UbiA_Cox10, Protoheme IX farnesyltransferase	NA|238aa|down_7|CP012670.1_3394840_3395554_+	cd02968, SCO, SCO (an acronym for Synthesis of Cytochrome c Oxidase) family; composed of proteins similar to Sco1, a membrane-anchored protein possessing a soluble domain with a TRX fold	NA|1502aa|down_8|CP012670.1_3395629_3400135_-	COG3604, FhlA, Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains [Transcription / Signal transduction mechanisms]	NA|265aa|down_9|CP012670.1_3400176_3400971_-	pfam11303, DUF3105, Protein of unknown function (DUF3105)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	23	3693312-3693413	21	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CGCGTCATCGCGGCTCGTGATGACC	25	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|298aa|up_9|CP012670.1_3677297_3678191_+,NA|194aa|up_5|CP012670.1_3685747_3686329_+,NA|145aa|up_3|CP012670.1_3687792_3688227_-,NA|373aa|up_1|CP012670.1_3690444_3691563_+,NA|412aa|up_0|CP012670.1_3691686_3692922_-,NA|574aa|down_4|CP012670.1_3701816_3703538_+	NA|298aa|up_9|CP012670.1_3677297_3678191_+	NA	NA|742aa|up_8|CP012670.1_3678411_3680637_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|757aa|up_7|CP012670.1_3680665_3682936_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|932aa|up_6|CP012670.1_3682952_3685748_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|194aa|up_5|CP012670.1_3685747_3686329_+	NA	NA|439aa|up_4|CP012670.1_3686394_3687711_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|145aa|up_3|CP012670.1_3687792_3688227_-	NA	NA|700aa|up_2|CP012670.1_3688289_3690389_+	TIGR02270, hypothetical_protein_GSU3180, conserved hypothetical protein	NA|373aa|up_1|CP012670.1_3690444_3691563_+	NA	NA|412aa|up_0|CP012670.1_3691686_3692922_-	NA	NA|245aa|down_0|CP012670.1_3693419_3694154_-	smart00935, OmpH, Outer membrane protein (OmpH-like)	NA|1336aa|down_1|CP012670.1_3694187_3698195_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|340aa|down_2|CP012670.1_3698383_3699403_-	pfam05308, Mito_fiss_reg, Mitochondrial fission regulator	NA|436aa|down_3|CP012670.1_3699907_3701215_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|574aa|down_4|CP012670.1_3701816_3703538_+	NA	NA|1010aa|down_5|CP012670.1_3703534_3706564_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|909aa|down_6|CP012670.1_3706681_3709408_+	PRK05399, PRK05399, DNA mismatch repair protein MutS; Provisional	NA|292aa|down_7|CP012670.1_3709656_3710532_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|316aa|down_8|CP012670.1_3710804_3711752_+	PRK09377, tsf, elongation factor Ts; Provisional	NA|315aa|down_9|CP012670.1_3711761_3712706_+	cd10940, CE4_PuuE_HpPgdA_like_1, Putative catalytic domain of uncharacterized bacterial polysaccharide deacetylases similar to bacterial PuuE allantoinases and Helicobacter pylori peptidoglycan deacetylase (HpPgdA)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	24	4263290-4263411	22	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GAGCGTCACATCAACTCCAGATTCCGATGAGCGCAGCGCGCTT	43	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|261aa|up_1|CP012670.1_4260922_4261705_+,NA|161aa|down_0|CP012670.1_4263591_4264074_+,NA|839aa|down_6|CP012670.1_4269274_4271791_-,NA|143aa|down_9|CP012670.1_4273503_4273932_-	NA|332aa|up_9|CP012670.1_4244177_4245173_+	pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen	NA|606aa|up_8|CP012670.1_4245591_4247409_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|637aa|up_7|CP012670.1_4247443_4249354_+	pfam08308, PEGA, PEGA domain	NA|1176aa|up_6|CP012670.1_4249477_4253005_-	sd00038, Kelch, Kelch repeat	NA|938aa|up_5|CP012670.1_4253240_4256054_+	sd00002, TSP3, Calcium-binding Thrombospondin type 3 (TSP3) repeat	NA|601aa|up_4|CP012670.1_4256327_4258130_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|504aa|up_3|CP012670.1_4258353_4259865_+	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|259aa|up_2|CP012670.1_4259979_4260756_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|261aa|up_1|CP012670.1_4260922_4261705_+	NA	NA|504aa|up_0|CP012670.1_4261715_4263227_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|161aa|down_0|CP012670.1_4263591_4264074_+	NA	NA|620aa|down_1|CP012670.1_4264334_4266194_-	cd01153, ACAD_fadE5, Putative acyl-CoA dehydrogenases similar to fadE5	NA|158aa|down_2|CP012670.1_4266475_4266949_+	PRK01885, greB, transcription elongation factor GreB; Reviewed	NA|132aa|down_3|CP012670.1_4266992_4267388_+	TIGR00068, Lactoylglutathione_lyase, lactoylglutathione lyase	NA|305aa|down_4|CP012670.1_4267566_4268481_-	PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed	NA|103aa|down_5|CP012670.1_4268814_4269123_+	COG2350, COG2350, Uncharacterized protein conserved in bacteria [Function unknown]	NA|839aa|down_6|CP012670.1_4269274_4271791_-	NA	NA|203aa|down_7|CP012670.1_4272257_4272866_+	pfam09536, DUF2378, Protein of unknown function (DUF2378)	NA|206aa|down_8|CP012670.1_4272955_4273573_+	cd02253, DmpA, L-Aminopeptidase D-amidase/D-esterase (DmpA) family; DmpA catalyzes the release of N-terminal D and L amino acids from peptide susbtrates	NA|143aa|down_9|CP012670.1_4273503_4273932_-	NA
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	25	4412444-4412656	23	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GGCACCCTGGCCGCGCAGGAGGGCCTGCGGGGCTGCGCAGGA	42	0	0	NA	NA	NA	2	2	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|67aa|up_9|CP012670.1_4402619_4402820_-,NA|125aa|up_0|CP012670.1_4411663_4412038_+,NA|233aa|down_4|CP012670.1_4417869_4418568_-,NA|76aa|down_8|CP012670.1_4423611_4423839_-	NA|67aa|up_9|CP012670.1_4402619_4402820_-	NA	NA|145aa|up_8|CP012670.1_4402632_4403067_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|124aa|up_7|CP012670.1_4403063_4403435_+	pfam13586, DDE_Tnp_1_2, Transposase DDE domain	NA|261aa|up_6|CP012670.1_4403428_4404211_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|217aa|up_5|CP012670.1_4404762_4405413_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|328aa|up_4|CP012670.1_4405500_4406484_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|166aa|up_3|CP012670.1_4406597_4407095_+	pfam14079, DUF4260, Domain of unknown function (DUF4260)	NA|774aa|up_2|CP012670.1_4407278_4409600_-	cd18817, GH43f_LbAraf43-like, Glycosyl hydrolase family 43 such as Lactobacillus brevis alpha-L-arabinofuranosidase LbAraf43	NA|519aa|up_1|CP012670.1_4409648_4411205_-	cd08983, GH43_Bt3655-like, Glycosyl hydrolase family 43 protein such as Bacteroides thetaiotaomicron VPI-5482 arabinofuranosidase Bt3655	NA|125aa|up_0|CP012670.1_4411663_4412038_+	NA	NA|203aa|down_0|CP012670.1_4412896_4413505_+	COG4832, COG4832, Uncharacterized conserved protein [Function unknown]	NA|266aa|down_1|CP012670.1_4413745_4414543_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|189aa|down_2|CP012670.1_4415357_4415924_+	PRK06203, aroB, 3-dehydroquinate synthase; Reviewed	NA|585aa|down_3|CP012670.1_4416066_4417821_-	pfam00339, Arrestin_N, Arrestin (or S-antigen), N-terminal domain	NA|233aa|down_4|CP012670.1_4417869_4418568_-	NA	NA|399aa|down_5|CP012670.1_4418564_4419761_-	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|473aa|down_6|CP012670.1_4420412_4421831_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|496aa|down_7|CP012670.1_4421855_4423343_-	pfam05593, RHS_repeat, RHS Repeat	NA|76aa|down_8|CP012670.1_4423611_4423839_-	NA	NA|333aa|down_9|CP012670.1_4424171_4425170_+	pfam05158, RNA_pol_Rpc34, RNA polymerase Rpc34 subunit
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	28	4814657-4814754	9	PILER-CR	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	ACGACG----GCACGTCGTCTG	22	0	0	NA	NA	NA	2	2	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|301aa|up_8|CP012670.1_4800962_4801865_-,NA|169aa|up_7|CP012670.1_4801945_4802452_+,NA|161aa|up_6|CP012670.1_4802448_4802931_+,NA|844aa|up_3|CP012670.1_4806716_4809248_-,NA|277aa|down_1|CP012670.1_4815128_4815959_-,NA|185aa|down_4|CP012670.1_4819912_4820467_-,NA|103aa|down_5|CP012670.1_4820764_4821073_+,NA|222aa|down_9|CP012670.1_4824836_4825502_-	NA|416aa|up_9|CP012670.1_4799644_4800892_-	COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]	NA|301aa|up_8|CP012670.1_4800962_4801865_-	NA	NA|169aa|up_7|CP012670.1_4801945_4802452_+	NA	NA|161aa|up_6|CP012670.1_4802448_4802931_+	NA	NA|414aa|up_5|CP012670.1_4802996_4804238_+	pfam09992, NAGPA, Phosphodiester glycosidase	NA|763aa|up_4|CP012670.1_4804423_4806712_-	PRK13557, PRK13557, histidine kinase; Provisional	NA|844aa|up_3|CP012670.1_4806716_4809248_-	NA	NA|511aa|up_2|CP012670.1_4809461_4810994_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|361aa|up_1|CP012670.1_4811221_4812304_-	sd00006, TPR, Tetratricopeptide repeat	NA|639aa|up_0|CP012670.1_4812517_4814434_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|87aa|down_0|CP012670.1_4814904_4815165_+	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|277aa|down_1|CP012670.1_4815128_4815959_-	NA	NA|443aa|down_2|CP012670.1_4816257_4817586_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|483aa|down_3|CP012670.1_4818002_4819451_-	cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|185aa|down_4|CP012670.1_4819912_4820467_-	NA	NA|103aa|down_5|CP012670.1_4820764_4821073_+	NA	NA|220aa|down_6|CP012670.1_4821462_4822122_-	PLN02434, PLN02434, fatty acid hydroxylase	NA|383aa|down_7|CP012670.1_4822242_4823391_-	cd14861, Fe-ADH-like, Iron-containing alcohol dehydrogenases-like	NA|454aa|down_8|CP012670.1_4823393_4824755_-	COG0174, GlnA, Glutamine synthetase [Amino acid transport and metabolism]	NA|222aa|down_9|CP012670.1_4824836_4825502_-	NA
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	29	5229701-5229817	26	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CCCCCTGGCAGCTCCCGTCTCCGGGGGGTGG	31	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|111aa|up_7|CP012670.1_5220560_5220893_-,NA|225aa|up_4|CP012670.1_5223758_5224433_+,NA|70aa|down_0|CP012670.1_5229970_5230180_-,NA|158aa|down_1|CP012670.1_5230230_5230704_+,NA|129aa|down_2|CP012670.1_5232354_5232741_+,NA|177aa|down_5|CP012670.1_5234585_5235116_+,NA|354aa|down_6|CP012670.1_5235145_5236207_+,NA|246aa|down_8|CP012670.1_5237484_5238222_-	NA|197aa|up_9|CP012670.1_5219288_5219879_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|207aa|up_8|CP012670.1_5219922_5220543_-	cd00564, TMP_TenI, Thiamine monophosphate synthase (TMP synthase)/TenI	NA|111aa|up_7|CP012670.1_5220560_5220893_-	NA	NA|428aa|up_6|CP012670.1_5220961_5222245_-	cd03798, GT4_WlbH-like, Bordetella parapertussis WlbH and similar proteins	NA|261aa|up_5|CP012670.1_5222842_5223625_-	PRK00208, thiG, thiazole synthase; Reviewed	NA|225aa|up_4|CP012670.1_5223758_5224433_+	NA	NA|320aa|up_3|CP012670.1_5224419_5225379_+	pfam00413, Peptidase_M10, Matrixin	NA|373aa|up_2|CP012670.1_5225468_5226587_+	cd06142, RNaseD_exo, DEDDy 3'-5' exonuclease domain of Ribonuclease D and similar proteins	NA|559aa|up_1|CP012670.1_5226708_5228385_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|358aa|up_0|CP012670.1_5228424_5229498_-	TIGR00433, biotin_synthase, biotin synthase	NA|70aa|down_0|CP012670.1_5229970_5230180_-	NA	NA|158aa|down_1|CP012670.1_5230230_5230704_+	NA	NA|129aa|down_2|CP012670.1_5232354_5232741_+	NA	NA|378aa|down_3|CP012670.1_5232760_5233894_-	pfam00665, rve, Integrase core domain	NA|120aa|down_4|CP012670.1_5233893_5234253_-	COG2963, COG2963, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|177aa|down_5|CP012670.1_5234585_5235116_+	NA	NA|354aa|down_6|CP012670.1_5235145_5236207_+	NA	NA|351aa|down_7|CP012670.1_5236284_5237337_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|246aa|down_8|CP012670.1_5237484_5238222_-	NA	NA|465aa|down_9|CP012670.1_5238218_5239613_-	pfam01964, ThiC_Rad_SAM, Radical SAM ThiC family
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	30	5956158-5956257	27	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GGTGTGGGGGACGGGCCGATGTC	23	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|147aa|up_8|CP012670.1_5948116_5948557_-,NA|57aa|up_0|CP012670.1_5955710_5955881_-,NA|72aa|down_1|CP012670.1_5958132_5958348_+,NA|494aa|down_4|CP012670.1_5966703_5968185_-,NA|229aa|down_5|CP012670.1_5968184_5968871_-,NA|347aa|down_6|CP012670.1_5969144_5970185_+	NA|345aa|up_9|CP012670.1_5946978_5948013_-	TIGR04180, NAD-dependent_epimerase/dehydratase, NAD dependent epimerase/dehydratase, LLPSF_EDH_00030 family	NA|147aa|up_8|CP012670.1_5948116_5948557_-	NA	NA|287aa|up_7|CP012670.1_5948669_5949530_+	COG3568, ElsH, Metal-dependent hydrolase [General function prediction only]	NA|332aa|up_6|CP012670.1_5949564_5950560_+	cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase	NA|285aa|up_5|CP012670.1_5950624_5951479_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|481aa|up_4|CP012670.1_5951831_5953274_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|148aa|up_3|CP012670.1_5953648_5954092_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|129aa|up_2|CP012670.1_5954146_5954533_+	cd07263, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|304aa|up_1|CP012670.1_5954714_5955626_-	TIGR04247, nitrous_oxide_maturation_protein_NosD, nitrous oxide reductase family maturation protein NosD	NA|57aa|up_0|CP012670.1_5955710_5955881_-	NA	NA|515aa|down_0|CP012670.1_5956436_5957981_-	PRK07107, PRK07107, IMP dehydrogenase	NA|72aa|down_1|CP012670.1_5958132_5958348_+	NA	NA|2131aa|down_2|CP012670.1_5958266_5964659_-	TIGR02148, ORFveg106_random, fibro-slime domain	NA|568aa|down_3|CP012670.1_5965039_5966743_+	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|494aa|down_4|CP012670.1_5966703_5968185_-	NA	NA|229aa|down_5|CP012670.1_5968184_5968871_-	NA	NA|347aa|down_6|CP012670.1_5969144_5970185_+	NA	NA|180aa|down_7|CP012670.1_5970227_5970767_-	pfam00498, FHA, FHA domain	NA|714aa|down_8|CP012670.1_5971025_5973167_+	pfam03512, Glyco_hydro_52, Glycosyl hydrolase family 52	NA|352aa|down_9|CP012670.1_5973176_5974232_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	31	5976613-5976702	28	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	TGCCCGGACCCGACCCAGGTTGCCCGG	27	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|494aa|up_7|CP012670.1_5966703_5968185_-,NA|229aa|up_6|CP012670.1_5968184_5968871_-,NA|347aa|up_5|CP012670.1_5969144_5970185_+,NA|161aa|down_2|CP012670.1_5978695_5979178_-,NA|209aa|down_9|CP012670.1_5989250_5989877_-	NA|2131aa|up_9|CP012670.1_5958266_5964659_-	TIGR02148, ORFveg106_random, fibro-slime domain	NA|568aa|up_8|CP012670.1_5965039_5966743_+	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|494aa|up_7|CP012670.1_5966703_5968185_-	NA	NA|229aa|up_6|CP012670.1_5968184_5968871_-	NA	NA|347aa|up_5|CP012670.1_5969144_5970185_+	NA	NA|180aa|up_4|CP012670.1_5970227_5970767_-	pfam00498, FHA, FHA domain	NA|714aa|up_3|CP012670.1_5971025_5973167_+	pfam03512, Glyco_hydro_52, Glycosyl hydrolase family 52	NA|352aa|up_2|CP012670.1_5973176_5974232_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|253aa|up_1|CP012670.1_5974683_5975442_-	cd05233, SDR_c, classical (c) SDRs	NA|350aa|up_0|CP012670.1_5975518_5976568_-	cd08174, G1PDH-like, Glycerol-1-phosphate dehydrogenase-like	NA|404aa|down_0|CP012670.1_5976725_5977937_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|207aa|down_1|CP012670.1_5978075_5978696_-	pfam13453, zf-TFIIB, Transcription factor zinc-finger	NA|161aa|down_2|CP012670.1_5978695_5979178_-	NA	NA|401aa|down_3|CP012670.1_5979885_5981088_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|244aa|down_4|CP012670.1_5981081_5981813_-	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|412aa|down_5|CP012670.1_5981814_5983050_-	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|511aa|down_6|CP012670.1_5983046_5984579_-	COG1538, TolC, Outer membrane protein [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion]	NA|237aa|down_7|CP012670.1_5984651_5985362_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|451aa|down_8|CP012670.1_5985358_5986711_+	PRK09470, cpxA, envelope stress sensor histidine kinase CpxA	NA|209aa|down_9|CP012670.1_5989250_5989877_-	NA
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	33	6118216-6118295	30	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CCGGGCGCCCCGGCGCTCTACCC	23	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA,NA|154aa|down_4|CP012670.1_6130495_6130957_+,NA|120aa|down_5|CP012670.1_6132331_6132691_+,NA|238aa|down_6|CP012670.1_6132871_6133585_+,NA|285aa|down_7|CP012670.1_6133586_6134441_+	NA|138aa|up_9|CP012670.1_6107427_6107841_-	cd14773, TrHb2_PhHbO-like_O, Truncated hemoglobins, group 2 (O); Pseudoalteromonas haloplanktis PhHbO like	NA|196aa|up_8|CP012670.1_6107973_6108561_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|312aa|up_7|CP012670.1_6108660_6109596_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|292aa|up_6|CP012670.1_6109656_6110532_-	cd05231, NmrA_TMR_like_1_SDR_a, NmrA (a transcriptional regulator) and triphenylmethane reductase (TMR) like proteins, subgroup 1, atypical (a) SDRs	NA|432aa|up_5|CP012670.1_6110920_6112216_+	smart00020, Tryp_SPc, Trypsin-like serine protease	NA|390aa|up_4|CP012670.1_6112389_6113559_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|197aa|up_3|CP012670.1_6113672_6114263_-	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein	NA|495aa|up_2|CP012670.1_6114348_6115833_-	TIGR03355, VI_chp_2, type VI secretion protein, EvpB/VC_A0108 family	NA|164aa|up_1|CP012670.1_6115836_6116328_-	pfam05591, T6SS_VipA, Type VI secretion system, VipA, VC_A0107 or Hcp2	NA|562aa|up_0|CP012670.1_6116436_6118122_-	pfam16989, T6SS_VasJ, Type VI secretion, EvfE, EvfF, ImpA, BimE, VC_A0119, VasJ	NA|1234aa|down_0|CP012670.1_6119904_6123606_-	TIGR03348, VI_IcmF, type VI secretion protein IcmF	NA|226aa|down_1|CP012670.1_6123648_6124326_-	pfam09850, DotU, Type VI secretion system protein DotU	NA|453aa|down_2|CP012670.1_6124360_6125719_-	pfam05936, T6SS_VasE, Bacterial Type VI secretion, VC_A0110, EvfL, ImpJ, VasE	NA|209aa|down_3|CP012670.1_6125884_6126511_-	pfam12790, T6SS-SciN, Type VI secretion lipoprotein, VasD, EvfM, TssJ, VC_A0113	NA|154aa|down_4|CP012670.1_6130495_6130957_+	NA	NA|120aa|down_5|CP012670.1_6132331_6132691_+	NA	NA|238aa|down_6|CP012670.1_6132871_6133585_+	NA	NA|285aa|down_7|CP012670.1_6133586_6134441_+	NA	NA|841aa|down_8|CP012670.1_6134477_6137000_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|302aa|down_9|CP012670.1_6136921_6137827_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	35	6263493-6263621	32	CRISPRCasFinder	no	WYL	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Unclear	CCCACGTTGAGCCGAGGCAGCCCGCCGCCGCCGGGGT	37	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA,NA	NA|471aa|up_9|CP012670.1_6251857_6253270_-	PLN03138, PLN03138, Protein TOC75; Provisional	NA|983aa|up_8|CP012670.1_6253350_6256299_-	TIGR01782, TonB-dependent_receptor, TonB-dependent receptor	NA|146aa|up_7|CP012670.1_6256400_6256838_-	COG0848, ExbD, Biopolymer transport protein [Intracellular trafficking and secretion]	NA|153aa|up_6|CP012670.1_6256842_6257301_-	COG0848, ExbD, Biopolymer transport protein [Intracellular trafficking and secretion]	NA|236aa|up_5|CP012670.1_6257254_6257962_-	TIGR02796, Protein_TolQ, TolQ protein	NA|261aa|up_4|CP012670.1_6258021_6258804_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|236aa|up_3|CP012670.1_6258977_6259685_-	TIGR02154, PhoB, phosphate regulon transcriptional regulatory protein PhoB	NA|199aa|up_2|CP012670.1_6259811_6260408_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|575aa|up_1|CP012670.1_6260421_6262146_-	pfam13367, PrsW-protease, Protease prsW family	NA|230aa|up_0|CP012670.1_6262552_6263242_-	cd16325, LolA, LolA, a periplasmic chaperone	NA|180aa|down_0|CP012670.1_6263636_6264176_-	PRK00039, ruvC, Holliday junction resolvase; Reviewed	NA|503aa|down_1|CP012670.1_6264180_6265689_-	TIGR00387, Glycolate_oxidase_subunit_glcD	NA|765aa|down_2|CP012670.1_6265842_6268137_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|65aa|down_3|CP012670.1_6268177_6268372_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|196aa|down_4|CP012670.1_6268685_6269273_-	cd00229, SGNH_hydrolase, SGNH_hydrolase, or GDSL_hydrolase, is a diverse family of lipases and esterases	NA|152aa|down_5|CP012670.1_6270103_6270559_-	cd01109, HTH_YyaN, Helix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB	NA|347aa|down_6|CP012670.1_6270555_6271596_-	cd19076, AKR_AKR13A_13D, AKR13A and AKR13D families of aldo-keto reductase (AKR)	NA|152aa|down_7|CP012670.1_6271688_6272144_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	WYL|374aa|down_8|CP012670.1_6272161_6273283_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|608aa|down_9|CP012670.1_6273399_6275223_+	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	36	6689636-6689747	33	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	TGGGGCCGTGACGAGGGGAGATCTGAGCGACGCGAA	36	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|144aa|up_9|CP012670.1_6680606_6681038_+,NA|61aa|up_5|CP012670.1_6683860_6684043_-,NA|327aa|up_4|CP012670.1_6684042_6685023_-,NA|213aa|down_3|CP012670.1_6694630_6695269_-,NA|78aa|down_8|CP012670.1_6700324_6700558_+	NA|144aa|up_9|CP012670.1_6680606_6681038_+	NA	NA|425aa|up_8|CP012670.1_6681200_6682475_-	COG3405, CelA, Endoglucanase Y [Carbohydrate transport and metabolism]	NA|258aa|up_7|CP012670.1_6682482_6683256_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|209aa|up_6|CP012670.1_6683252_6683879_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|61aa|up_5|CP012670.1_6683860_6684043_-	NA	NA|327aa|up_4|CP012670.1_6684042_6685023_-	NA	NA|216aa|up_3|CP012670.1_6685155_6685803_-	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|261aa|up_2|CP012670.1_6685909_6686692_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|353aa|up_1|CP012670.1_6686691_6687750_+	pfam09084, NMT1, NMT1/THI5 like	NA|554aa|up_0|CP012670.1_6687838_6689500_+	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	NA|378aa|down_0|CP012670.1_6690150_6691284_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|799aa|down_1|CP012670.1_6691315_6693712_-	pfam07090, GATase1_like, Putative glutamine amidotransferase	NA|276aa|down_2|CP012670.1_6693794_6694622_-	pfam13709, DUF4159, Domain of unknown function (DUF4159)	NA|213aa|down_3|CP012670.1_6694630_6695269_-	NA	NA|489aa|down_4|CP012670.1_6695409_6696876_+	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|444aa|down_5|CP012670.1_6696872_6698204_+	cd17262, RMtype1_S_Aco12261I-TRD2-CR2, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Aminobacterium colombiense DSM 12261 S subunit (S	NA|224aa|down_6|CP012670.1_6698200_6698872_-	pfam03070, TENA_THI-4, TENA/THI-4/PQQC family	NA|346aa|down_7|CP012670.1_6699167_6700205_+	pfam15967, Nucleoporin_FG2, Nucleoporin FG repeated region	NA|78aa|down_8|CP012670.1_6700324_6700558_+	NA	NA|554aa|down_9|CP012670.1_6700665_6702327_+	cd09140, PLDc_vPLD1_2_like_bac_1, Catalytic domain, repeat 1, of uncharacterized bacterial proteins with similarity to vertebrate phospholipases, PLD1 and PLD2
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	38	7119736-7119850	35	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CCGGTGAGCCCCCAGGGGCCCCACCACCTCCCCGGGG	37	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|174aa|up_6|CP012670.1_7112379_7112901_-,NA|245aa|down_0|CP012670.1_7119885_7120620_-,NA|494aa|down_1|CP012670.1_7120733_7122215_+,NA|195aa|down_4|CP012670.1_7124337_7124922_+,NA|183aa|down_6|CP012670.1_7126134_7126683_+,NA|256aa|down_7|CP012670.1_7126747_7127515_-,NA|277aa|down_8|CP012670.1_7128146_7128977_+	NA|493aa|up_9|CP012670.1_7108152_7109631_+	PRK00093, PRK00093, GTP-binding protein Der; Reviewed	NA|357aa|up_8|CP012670.1_7109836_7110907_+	cd01166, KdgK, 2-keto-3-deoxygluconate kinase (KdgK) phosphorylates 2-keto-3-deoxygluconate (KDG) to form 2-keto-3-deoxy-6-phosphogluconate (KDGP)	NA|409aa|up_7|CP012670.1_7111035_7112262_-	COG0003, ArsA, Predicted ATPase involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|174aa|up_6|CP012670.1_7112379_7112901_-	NA	NA|242aa|up_5|CP012670.1_7112972_7113698_+	COG4464, CapC, Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane]	NA|474aa|up_4|CP012670.1_7113841_7115263_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|389aa|up_3|CP012670.1_7115612_7116779_+	PRK09354, recA, recombinase A; Provisional	NA|168aa|up_2|CP012670.1_7116799_7117303_+	COG2062, SixA, Phosphohistidine phosphatase SixA [Signal transduction mechanisms]	NA|286aa|up_1|CP012670.1_7117591_7118449_+	TIGR00592, DNA_polymerase_alpha_catalytic_subunit, DNA polymerase (pol2)	NA|325aa|up_0|CP012670.1_7118618_7119593_+	TIGR02692, putative_tRNA_nucleotidyltransferase, tRNA adenylyltransferase	NA|245aa|down_0|CP012670.1_7119885_7120620_-	NA	NA|494aa|down_1|CP012670.1_7120733_7122215_+	NA	NA|405aa|down_2|CP012670.1_7122341_7123556_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|156aa|down_3|CP012670.1_7123659_7124127_+	cd03017, PRX_BCP, Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of  thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides	NA|195aa|down_4|CP012670.1_7124337_7124922_+	NA	NA|233aa|down_5|CP012670.1_7124968_7125667_+	cd00405, PRAI, Phosphoribosylanthranilate isomerase (PRAI) catalyzes the fourth step of the tryptophan biosynthesis, the conversion of N-(5'- phosphoribosyl)-anthranilate (PRA) to 1-(o-carboxyphenylamino)- 1-deoxyribulose 5-phosphate (CdRP)	NA|183aa|down_6|CP012670.1_7126134_7126683_+	NA	NA|256aa|down_7|CP012670.1_7126747_7127515_-	NA	NA|277aa|down_8|CP012670.1_7128146_7128977_+	NA	NA|450aa|down_9|CP012670.1_7128918_7130268_+	pfam00665, rve, Integrase core domain
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	39	7664001-7664067	36	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GCCGGCTGACGTGGCCGTGGTCGT	24	1	1	7664025-7664043	CP012670.1_7664074-7664092	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA,NA|120aa|down_1|CP012670.1_7667040_7667400_+,NA|176aa|down_7|CP012670.1_7677400_7677928_-	NA|768aa|up_9|CP012670.1_7648336_7650640_-	cd05254, dTDP_HR_like_SDR_e, dTDP-6-deoxy-L-lyxo-4-hexulose reductase and related proteins, extended (e) SDRs	NA|374aa|up_8|CP012670.1_7650636_7651758_-	COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis, outer membrane]	NA|380aa|up_7|CP012670.1_7651761_7652901_-	cd04950, GT4_TuaH-like, teichuronic acid biosynthesis glycosyltransferase TuaH and similar proteins	NA|627aa|up_6|CP012670.1_7653255_7655136_-	PRK10150, PRK10150, beta-D-glucuronidase; Provisional	NA|383aa|up_5|CP012670.1_7655648_7656797_+	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|280aa|up_4|CP012670.1_7656960_7657800_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|401aa|up_3|CP012670.1_7657802_7659005_+	COG3825, COG3825, Uncharacterized protein conserved in bacteria [Function unknown]	NA|278aa|up_2|CP012670.1_7659047_7659881_-	pfam05685, Uma2, Putative restriction endonuclease	NA|255aa|up_1|CP012670.1_7660038_7660803_-	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|1018aa|up_0|CP012670.1_7660854_7663908_-	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|736aa|down_0|CP012670.1_7664450_7666658_+	pfam08757, CotH, CotH kinase protein	NA|120aa|down_1|CP012670.1_7667040_7667400_+	NA	NA|432aa|down_2|CP012670.1_7667615_7668911_+	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|659aa|down_3|CP012670.1_7669098_7671075_+	smart00387, HATPase_c, Histidine kinase-like ATPases	NA|751aa|down_4|CP012670.1_7671071_7673324_+	smart00387, HATPase_c, Histidine kinase-like ATPases	NA|307aa|down_5|CP012670.1_7673764_7674685_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|656aa|down_6|CP012670.1_7675113_7677081_-	cd04843, Peptidases_S8_11, Peptidase S8 family domain, uncharacterized subfamily 11	NA|176aa|down_7|CP012670.1_7677400_7677928_-	NA	NA|466aa|down_8|CP012670.1_7677924_7679322_-	COG4938, COG4938, Uncharacterized conserved protein [Function unknown]	NA|772aa|down_9|CP012670.1_7679781_7682097_-	pfam11369, DUF3160, Protein of unknown function (DUF3160)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	40	8147006-8147116	37	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CCTCCCCTGCTCTCCGTAACGGACGCTCGGCCC	33	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|198aa|up_6|CP012670.1_8137869_8138463_+,NA|160aa|down_1|CP012670.1_8148684_8149164_-	NA|316aa|up_9|CP012670.1_8135010_8135958_-	COG0077, PheA, Prephenate dehydratase [Amino acid transport and metabolism]	NA|200aa|up_8|CP012670.1_8136489_8137089_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|151aa|up_7|CP012670.1_8137100_8137553_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|198aa|up_6|CP012670.1_8137869_8138463_+	NA	NA|811aa|up_5|CP012670.1_8138499_8140932_-	PRK05580, PRK05580, primosome assembly protein PriA; Validated	NA|199aa|up_4|CP012670.1_8141402_8141999_+	pfam09990, DUF2231, Predicted membrane protein (DUF2231)	NA|493aa|up_3|CP012670.1_8142017_8143496_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|326aa|up_2|CP012670.1_8143691_8144669_+	cd01144, BtuF, Cobalamin binding protein BtuF	NA|89aa|up_1|CP012670.1_8144775_8145042_+	PRK00982, acpP, acyl carrier protein; Provisional	NA|602aa|up_0|CP012670.1_8145038_8146844_+	cd05931, FAAL, Fatty acyl-AMP ligase (FAAL)	NA|466aa|down_0|CP012670.1_8147225_8148623_+	COG2271, UhpC, Sugar phosphate permease [Carbohydrate transport and metabolism]	NA|160aa|down_1|CP012670.1_8148684_8149164_-	NA	NA|199aa|down_2|CP012670.1_8149277_8149874_+	COG0605, SodA, Superoxide dismutase [Inorganic ion transport and metabolism]	NA|529aa|down_3|CP012670.1_8150267_8151854_+	pfam09937, DUF2169, Uncharacterized protein conserved in bacteria (DUF2169)	NA|487aa|down_4|CP012670.1_8151883_8153344_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|922aa|down_5|CP012670.1_8153340_8156106_+	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|787aa|down_6|CP012670.1_8156162_8158523_+	pfam09937, DUF2169, Uncharacterized protein conserved in bacteria (DUF2169)	NA|669aa|down_7|CP012670.1_8158519_8160526_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|283aa|down_8|CP012670.1_8160587_8161436_+	pfam12059, DUF3540, Protein of unknown function (DUF3540)	NA|138aa|down_9|CP012670.1_8161447_8161861_+	pfam13665, DUF4150, Domain of unknown function (DUF4150)
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	41	8683416-8683691	10	CRT	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CGACCGGTCCGACCGCCCTCG	21	0	0	NA	NA	NA	6	6	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|34aa|up_6|CP012670.1_8670634_8670736_-,NA|435aa|up_4|CP012670.1_8672729_8674034_-,NA|64aa|up_3|CP012670.1_8674494_8674686_+,NA|906aa|down_2|CP012670.1_8687152_8689870_-,NA|147aa|down_9|CP012670.1_8701316_8701757_+	NA|260aa|up_9|CP012670.1_8667572_8668352_-	pfam04982, HPP, HPP family	NA|236aa|up_8|CP012670.1_8668402_8669110_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]	NA|509aa|up_7|CP012670.1_8669111_8670638_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|34aa|up_6|CP012670.1_8670634_8670736_-	NA	NA|613aa|up_5|CP012670.1_8670798_8672637_-	cd01115, SLC13_permease, Permease SLC13 (solute carrier 13)	NA|435aa|up_4|CP012670.1_8672729_8674034_-	NA	NA|64aa|up_3|CP012670.1_8674494_8674686_+	NA	NA|172aa|up_2|CP012670.1_8674980_8675496_+	pfam01814, Hemerythrin, Hemerythrin HHE cation binding domain	NA|1777aa|up_1|CP012670.1_8676310_8681641_-	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|305aa|up_0|CP012670.1_8681889_8682804_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|414aa|down_0|CP012670.1_8683893_8685135_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|508aa|down_1|CP012670.1_8685229_8686753_-	PRK08204, PRK08204, hypothetical protein; Provisional	NA|906aa|down_2|CP012670.1_8687152_8689870_-	NA	NA|1101aa|down_3|CP012670.1_8689866_8693169_-	pfam14102, Caps_synth_CapC, Capsule biosynthesis CapC	NA|807aa|down_4|CP012670.1_8695718_8698139_+	COG4252, COG4252, Predicted transmembrane sensor domain [Signal transduction mechanisms]	NA|276aa|down_5|CP012670.1_8698272_8699100_+	sd00006, TPR, Tetratricopeptide repeat	NA|100aa|down_6|CP012670.1_8699048_8699348_+	pfam14076, DUF4258, Domain of unknown function (DUF4258)	NA|157aa|down_7|CP012670.1_8699344_8699815_+	TIGR03830, transcriptional_regulator_XRE_family, putative zinc finger/helix-turn-helix protein, YgiT family	NA|354aa|down_8|CP012670.1_8699943_8701005_+	cd10447, GIY-YIG_unchar_2, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria and archaea	NA|147aa|down_9|CP012670.1_8701316_8701757_+	NA
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	44	9186842-9190447	40,11,10,11	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GCTTCAATGGGGCCGCCGCCTTTCAGCGGCGGAGAG,GCTTCAATGGGGCCGCCGCCTTTCAGCGGCGGAGAG,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36,36	0	0	NA	NA	NA:NA:NA:NA	49,49,46,46	49	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|89aa|up_9|CP012670.1_9170679_9170946_-,NA|87aa|up_8|CP012670.1_9170984_9171245_-,NA|134aa|up_7|CP012670.1_9171247_9171649_-,NA|95aa|down_1|CP012670.1_9192881_9193166_+,NA|333aa|down_9|CP012670.1_9205407_9206406_+	NA|89aa|up_9|CP012670.1_9170679_9170946_-	NA	NA|87aa|up_8|CP012670.1_9170984_9171245_-	NA	NA|134aa|up_7|CP012670.1_9171247_9171649_-	NA	NA|2171aa|up_6|CP012670.1_9171679_9178192_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|147aa|up_5|CP012670.1_9178411_9178852_-	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|373aa|up_4|CP012670.1_9178995_9180114_+	cd02933, OYE_like_FMN, Old yellow enzyme (OYE)-like FMN binding domain	NA|993aa|up_3|CP012670.1_9180325_9183304_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|280aa|up_2|CP012670.1_9183690_9184530_-	PRK03427, PRK03427, cell division protein ZipA; Provisional	NA|228aa|up_1|CP012670.1_9184480_9185164_-	PHA02518, PHA02518, ParA-like protein; Provisional	NA|393aa|up_0|CP012670.1_9185517_9186696_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|319aa|down_0|CP012670.1_9191724_9192681_+	PRK05687, fliH, flagellar assembly protein FliH	NA|95aa|down_1|CP012670.1_9192881_9193166_+	NA	NA|1378aa|down_2|CP012670.1_9193916_9198050_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|405aa|down_3|CP012670.1_9198046_9199261_+	PRK13903, murB, UDP-N-acetylmuramate dehydrogenase	NA|360aa|down_4|CP012670.1_9199493_9200573_+	COG0077, PheA, Prephenate dehydratase [Amino acid transport and metabolism]	NA|298aa|down_5|CP012670.1_9200619_9201513_+	pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5)	NA|483aa|down_6|CP012670.1_9201830_9203279_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|200aa|down_7|CP012670.1_9203387_9203987_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|349aa|down_8|CP012670.1_9203983_9205030_+	TIGR02795, Uncharacterized_protein_in_oprL_3'region, tol-pal system protein YbgF	NA|333aa|down_9|CP012670.1_9205407_9206406_+	NA
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	45	9834700-9834796	41	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GGACGATCCGGCGCGCGGGGGGAACGAC	28	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|141aa|up_0|CP012670.1_9832490_9832913_-,NA|166aa|down_1|CP012670.1_9835859_9836357_+	NA|193aa|up_9|CP012670.1_9822917_9823496_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|91aa|up_8|CP012670.1_9823671_9823944_+	cd10153, RcnR-FrmR-like_DUF156, Transcriptional regulators RcnR and FrmR, and related domains; this domain family was previously known as part of DUF156	NA|335aa|up_7|CP012670.1_9823952_9824957_+	COG1230, CzcD, Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]	NA|190aa|up_6|CP012670.1_9824953_9825523_+	COG1971, COG1971, Predicted membrane protein [Function unknown]	NA|280aa|up_5|CP012670.1_9826011_9826851_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|124aa|up_4|CP012670.1_9827136_9827508_+	pfam13473, Cupredoxin_1, Cupredoxin-like domain	NA|312aa|up_3|CP012670.1_9827602_9828538_+	pfam05023, Phytochelatin, Phytochelatin synthase	NA|791aa|up_2|CP012670.1_9828659_9831032_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|468aa|up_1|CP012670.1_9831094_9832498_-	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|141aa|up_0|CP012670.1_9832490_9832913_-	NA	NA|358aa|down_0|CP012670.1_9834910_9835984_+	pfam03411, Peptidase_M74, Penicillin-insensitive murein endopeptidase	NA|166aa|down_1|CP012670.1_9835859_9836357_+	NA	NA|270aa|down_2|CP012670.1_9836290_9837100_-	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	NA|293aa|down_3|CP012670.1_9837178_9838057_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|240aa|down_4|CP012670.1_9838112_9838832_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|221aa|down_5|CP012670.1_9838860_9839523_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|136aa|down_6|CP012670.1_9839845_9840253_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|1015aa|down_7|CP012670.1_9840376_9843421_+	COG1554, ATH1, Trehalose and maltose hydrolases (possible phosphorylases) [Carbohydrate transport and metabolism]	NA|325aa|down_8|CP012670.1_9843414_9844389_+	pfam00582, Usp, Universal stress protein family	NA|787aa|down_9|CP012670.1_9844467_9846828_+	pfam13654, AAA_32, AAA domain
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	46	10224378-10224496	42	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	AATGCCCTCAGACCCCCCGCCCCCGCTGGGCC	32	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|154aa|up_6|CP012670.1_10213026_10213488_+,NA|50aa|down_1|CP012670.1_10230310_10230460_+,NA|104aa|down_3|CP012670.1_10230883_10231195_+,NA|152aa|down_6|CP012670.1_10233779_10234235_+	NA|257aa|up_9|CP012670.1_10209532_10210303_+	pfam04343, DUF488, Protein of unknown function, DUF488	NA|223aa|up_8|CP012670.1_10210563_10211232_+	PRK00107, gidB, 16S rRNA (guanine(527)-N(7))-methyltransferase RsmG	NA|430aa|up_7|CP012670.1_10211463_10212753_+	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|154aa|up_6|CP012670.1_10213026_10213488_+	NA	NA|310aa|up_5|CP012670.1_10213594_10214524_-	PRK06522, PRK06522, 2-dehydropantoate 2-reductase; Reviewed	NA|617aa|up_4|CP012670.1_10214619_10216470_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|847aa|up_3|CP012670.1_10217052_10219593_+	PRK14989, PRK14989, nitrite reductase subunit NirD; Provisional	NA|133aa|up_2|CP012670.1_10219627_10220026_+	pfam13806, Rieske_2, Rieske-like [2Fe-2S] domain	NA|577aa|up_1|CP012670.1_10220141_10221872_+	cd16371, DMSOR_beta_like, uncharacterized subfamily of DMSO Reductase beta subunit family	NA|779aa|up_0|CP012670.1_10221871_10224208_+	cd02754, MopB_Nitrate-R-NapA-like, Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite	NA|431aa|down_0|CP012670.1_10228521_10229814_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|50aa|down_1|CP012670.1_10230310_10230460_+	NA	NA|58aa|down_2|CP012670.1_10230623_10230797_-	PRK15313, PRK15313, intestinal colonization autotransporter adhesin MisL	NA|104aa|down_3|CP012670.1_10230883_10231195_+	NA	NA|367aa|down_4|CP012670.1_10231236_10232337_-	COG0119, LeuA, Isopropylmalate/homocitrate/citramalate synthases [Amino acid transport and metabolism]	NA|243aa|down_5|CP012670.1_10232666_10233395_+	pfam12850, Metallophos_2, Calcineurin-like phosphoesterase superfamily domain	NA|152aa|down_6|CP012670.1_10233779_10234235_+	NA	NA|355aa|down_7|CP012670.1_10234733_10235798_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|555aa|down_8|CP012670.1_10235794_10237459_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|545aa|down_9|CP012670.1_10237487_10239122_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	47	10489524-10489634	43	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	CTCCCGGGAGGCTTCGGAGAGCCTCCG	27	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|365aa|up_9|CP012670.1_10474418_10475513_-,NA|389aa|up_4|CP012670.1_10481342_10482509_+,NA|147aa|up_2|CP012670.1_10484259_10484700_+,NA|136aa|down_3|CP012670.1_10493504_10493912_+,NA|284aa|down_7|CP012670.1_10497658_10498510_-,NA|154aa|down_9|CP012670.1_10500181_10500643_-	NA|365aa|up_9|CP012670.1_10474418_10475513_-	NA	NA|77aa|up_8|CP012670.1_10475946_10476177_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|260aa|up_7|CP012670.1_10476468_10477248_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|451aa|up_6|CP012670.1_10477244_10478597_+	pfam04773, FecR, FecR protein	NA|743aa|up_5|CP012670.1_10478817_10481046_+	PRK06111, PRK06111, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|389aa|up_4|CP012670.1_10481342_10482509_+	NA	NA|475aa|up_3|CP012670.1_10482562_10483987_+	pfam13289, SIR2_2, SIR2-like domain	NA|147aa|up_2|CP012670.1_10484259_10484700_+	NA	NA|965aa|up_1|CP012670.1_10484833_10487728_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|542aa|up_0|CP012670.1_10487718_10489344_+	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|152aa|down_0|CP012670.1_10489638_10490094_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|574aa|down_1|CP012670.1_10490308_10492030_+	PRK08609, PRK08609, DNA polymerase/3'-5' exonuclease PolX	NA|297aa|down_2|CP012670.1_10492377_10493268_+	pfam12833, HTH_18, Helix-turn-helix domain	NA|136aa|down_3|CP012670.1_10493504_10493912_+	NA	NA|219aa|down_4|CP012670.1_10494179_10494836_-	pfam07602, DUF1565, Protein of unknown function (DUF1565)	NA|466aa|down_5|CP012670.1_10495266_10496664_+	COG0031, CysK, Cysteine synthase [Amino acid transport and metabolism]	NA|311aa|down_6|CP012670.1_10496722_10497655_+	COG1878, COG1878, Kynurenine formamidase [Amino acid transport and metabolism]	NA|284aa|down_7|CP012670.1_10497658_10498510_-	NA	NA|308aa|down_8|CP012670.1_10499230_10500154_+	COG0492, TrxB, Thioredoxin reductase [Posttranslational modification, protein turnover, chaperones]	NA|154aa|down_9|CP012670.1_10500181_10500643_-	NA
GCA_004135735.1_ASM413573v1	CP012670	Sorangium cellulosum strain So ceGT47 chromosome, complete genome	49	10702732-10702952	44	CRISPRCasFinder	no		csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	Orphan	GCAGATGCGCGTCGCCGCGGGCATTGGCGCGGCGGGGCATCGGCAGATGCGCGT	54	0	0	NA	NA	NA	1	1	Orphan	csb2gr5,cas7,cas8u1,cas3,cas1,cas2,cas10,cmr3gr5,cmr1gr7,cmr4gr7,cmr5gr11,cmr6gr7,csa3,PD-DExK,DEDDh,RT,WYL,DinG	NA|229aa|up_4|CP012670.1_10694704_10695391_+,NA|59aa|down_3|CP012670.1_10705692_10705869_-,NA|94aa|down_6|CP012670.1_10709735_10710017_-,NA|92aa|down_7|CP012670.1_10710167_10710443_-	NA|277aa|up_9|CP012670.1_10688896_10689727_+	pfam14219, DUF4328, Domain of unknown function (DUF4328)	NA|312aa|up_8|CP012670.1_10689792_10690728_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|244aa|up_7|CP012670.1_10690838_10691570_+	cd05233, SDR_c, classical (c) SDRs	NA|106aa|up_6|CP012670.1_10691559_10691877_+	pfam07045, DUF1330, Domain of unknown function (DUF1330)	NA|732aa|up_5|CP012670.1_10692206_10694402_+	pfam09826, Beta_propel, Beta propeller domain	NA|229aa|up_4|CP012670.1_10694704_10695391_+	NA	NA|477aa|up_3|CP012670.1_10695433_10696864_+	smart00020, Tryp_SPc, Trypsin-like serine protease	NA|310aa|up_2|CP012670.1_10696895_10697825_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|503aa|up_1|CP012670.1_10697957_10699466_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|890aa|up_0|CP012670.1_10699879_10702549_+	pfam09937, DUF2169, Uncharacterized protein conserved in bacteria (DUF2169)	NA|322aa|down_0|CP012670.1_10703155_10704121_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|134aa|down_1|CP012670.1_10704429_10704831_-	pfam13665, DUF4150, Domain of unknown function (DUF4150)	NA|214aa|down_2|CP012670.1_10704857_10705499_-	pfam12059, DUF3540, Protein of unknown function (DUF3540)	NA|59aa|down_3|CP012670.1_10705692_10705869_-	NA	NA|355aa|down_4|CP012670.1_10705945_10707010_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|887aa|down_5|CP012670.1_10707006_10709667_-	pfam09937, DUF2169, Uncharacterized protein conserved in bacteria (DUF2169)	NA|94aa|down_6|CP012670.1_10709735_10710017_-	NA	NA|92aa|down_7|CP012670.1_10710167_10710443_-	NA	NA|791aa|down_8|CP012670.1_10710501_10712874_-	TIGR03361, VI_Rhs_Vgr, type VI secretion system Vgr family protein	NA|1309aa|down_9|CP012670.1_10713369_10717296_+	TIGR02148, ORFveg106_random, fibro-slime domain
