assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	2	253478-253646	2	PILER-CR	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	TCGGCTCCAGGGGGTTTTCCGACCACC	27	0	0	NA	NA	NA	2	2	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|183aa|up_8|CP012672.1_243745_244294_-,NA|73aa|up_4|CP012672.1_247649_247868_+,NA|118aa|up_2|CP012672.1_249528_249882_+,NA|992aa|up_0|CP012672.1_250494_253470_+,NA|130aa|down_2|CP012672.1_257050_257440_-,NA|114aa|down_3|CP012672.1_258508_258850_+,NA|114aa|down_7|CP012672.1_261722_262064_-,NA|51aa|down_8|CP012672.1_262479_262632_+,NA|65aa|down_9|CP012672.1_262628_262823_+	NA|582aa|up_9|CP012672.1_241949_243695_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|183aa|up_8|CP012672.1_243745_244294_-	NA	NA|181aa|up_7|CP012672.1_244672_245215_-	pfam00436, SSB, Single-strand binding protein family	NA|421aa|up_6|CP012672.1_245211_246474_-	pfam12705, PDDEXK_1, PD-(D/E)XK nuclease superfamily	NA|269aa|up_5|CP012672.1_246470_247277_-	pfam04404, ERF, ERF superfamily	NA|73aa|up_4|CP012672.1_247649_247868_+	NA	NA|556aa|up_3|CP012672.1_247864_249532_+	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|118aa|up_2|CP012672.1_249528_249882_+	NA	NA|128aa|up_1|CP012672.1_249871_250255_+	smart00990, VRR_NUC, This model contains proteins with the VRR-NUC domain	NA|992aa|up_0|CP012672.1_250494_253470_+	NA	NA|433aa|down_0|CP012672.1_253958_255257_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|406aa|down_1|CP012672.1_255476_256694_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|130aa|down_2|CP012672.1_257050_257440_-	NA	NA|114aa|down_3|CP012672.1_258508_258850_+	NA	NA|70aa|down_4|CP012672.1_258846_259056_+	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|532aa|down_5|CP012672.1_259632_261228_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|124aa|down_6|CP012672.1_261354_261726_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|114aa|down_7|CP012672.1_261722_262064_-	NA	NA|51aa|down_8|CP012672.1_262479_262632_+	NA	NA|65aa|down_9|CP012672.1_262628_262823_+	NA
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	3	489919-490090	1	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GCCCGCAACCCGGGTCGGAGACCTCAACCCCGGTCGGGCGGC	42	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA,NA|428aa|down_5|CP012672.1_496052_497336_+	NA|353aa|up_9|CP012672.1_477591_478650_-	pfam00348, polyprenyl_synt, Polyprenyl synthetase	NA|459aa|up_8|CP012672.1_479661_481038_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|464aa|up_7|CP012672.1_481263_482655_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|219aa|up_6|CP012672.1_482704_483361_-	pfam04264, YceI, YceI-like domain	NA|267aa|up_5|CP012672.1_483382_484183_-	cd07363, 45_DOPA_Dioxygenase, The Class III extradiol dioxygenase, 4,5-DOPA Dioxygenase, catalyzes the incorporation of both atoms of molecular oxygen into 4,5-dihydroxy-phenylalanine	NA|320aa|up_4|CP012672.1_484372_485332_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|349aa|up_3|CP012672.1_485796_486843_+	cd13558, PBP2_SsuA_like_2, Putative substrate binding domain of sulfonate binding protein, the type 2 periplasmic binding protein fold	NA|306aa|up_2|CP012672.1_486839_487757_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|263aa|up_1|CP012672.1_487732_488521_+	COG1116, TauB, ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism]	NA|438aa|up_0|CP012672.1_488559_489873_+	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|543aa|down_0|CP012672.1_490449_492078_+	pfam05576, Peptidase_S37, PS-10 peptidase S37	NA|275aa|down_1|CP012672.1_492130_492955_-	pfam12833, HTH_18, Helix-turn-helix domain	NA|225aa|down_2|CP012672.1_493020_493695_+	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|441aa|down_3|CP012672.1_493857_495180_+	cd07561, Peptidase_S41_CPP_like, C-terminal processing peptidase-like; serine protease family S41	NA|213aa|down_4|CP012672.1_495376_496015_+	cd03016, PRX_1cys, Peroxiredoxin (PRX) family, 1-cys PRX subfamily; composed of PRXs containing only one conserved cysteine, which serves as the peroxidatic cysteine	NA|428aa|down_5|CP012672.1_496052_497336_+	NA	NA|295aa|down_6|CP012672.1_497416_498301_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|495aa|down_7|CP012672.1_498960_500445_+	COG3405, CelA, Endoglucanase Y [Carbohydrate transport and metabolism]	NA|378aa|down_8|CP012672.1_500479_501613_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|379aa|down_9|CP012672.1_501728_502865_+	cd03807, GT4_WbnK-like, Shigella dysenteriae WbnK and similar proteins
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	4	676547-677167	2,1,3	CRISPRCasFinder,CRT,PILER-CR	no	cas8u1,cas3,csb2gr5,csb1gr7	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Unclear	CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	8,8,7	8	Unclear	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	cas8u1|313aa|up_5|CP012672.1_666447_667386_-,NA|271aa|up_0|CP012672.1_675022_675835_-,NA|72aa|down_0|CP012672.1_677209_677425_+,NA|372aa|down_5|CP012672.1_681772_682888_+	NA|331aa|up_9|CP012672.1_661710_662703_-	pfam13578, Methyltransf_24, Methyltransferase domain	NA|359aa|up_8|CP012672.1_662699_663776_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|249aa|up_7|CP012672.1_663772_664519_-	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|154aa|up_6|CP012672.1_665833_666295_+	pfam10137, TIR-like, Predicted nucleotide-binding protein containing TIR-like domain	cas8u1|313aa|up_5|CP012672.1_666447_667386_-	NA	cas3|1000aa|up_4|CP012672.1_667382_670382_-	TIGR02621, CRISPR-associated_helicase_Cas3, CRISPR-associated helicase Cas3, subtype Dpsyc	csb2gr5|564aa|up_3|CP012672.1_670374_672066_-	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	csb1gr7|425aa|up_2|CP012672.1_672065_673340_-	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	NA|313aa|up_1|CP012672.1_673769_674708_-	PRK05687, fliH, flagellar assembly protein FliH	NA|271aa|up_0|CP012672.1_675022_675835_-	NA	NA|72aa|down_0|CP012672.1_677209_677425_+	NA	NA|86aa|down_1|CP012672.1_677675_677933_+	pfam03683, UPF0175, Uncharacterized protein family (UPF0175)	NA|104aa|down_2|CP012672.1_678106_678418_+	COG2405, COG2405, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|541aa|down_3|CP012672.1_678821_680444_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|363aa|down_4|CP012672.1_680539_681628_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|372aa|down_5|CP012672.1_681772_682888_+	NA	NA|520aa|down_6|CP012672.1_682928_684488_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|1643aa|down_7|CP012672.1_685368_690297_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|230aa|down_8|CP012672.1_690403_691093_-	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|347aa|down_9|CP012672.1_691241_692282_-	pfam01551, Peptidase_M23, Peptidase family M23
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	5	874593-874693	3	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGATGCACGGGCAGGCGACGATGCACGGCCAGCCG	35	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|50aa|up_1|CP012672.1_869117_869267_+,NA|295aa|down_9|CP012672.1_888308_889193_-	NA|57aa|up_9|CP012672.1_860626_860797_+	PRK00504, rpmG, 50S ribosomal protein L33; Validated	NA|209aa|up_8|CP012672.1_860986_861613_+	pfam00584, SecE, SecE/Sec61-gamma subunits of protein translocation complex	NA|177aa|up_7|CP012672.1_861643_862174_+	PRK05609, nusG, transcription antitermination protein NusG; Validated	NA|148aa|up_6|CP012672.1_862291_862735_+	PRK00140, rplK, 50S ribosomal protein L11; Validated	NA|237aa|up_5|CP012672.1_862880_863591_+	PRK05424, rplA, 50S ribosomal protein L1; Validated	NA|177aa|up_4|CP012672.1_863594_864125_+	PRK00099, rplJ, 50S ribosomal protein L10; Reviewed	NA|131aa|up_3|CP012672.1_864222_864615_+	PRK00157, rplL, 50S ribosomal protein L7/L12; Reviewed	NA|1379aa|up_2|CP012672.1_864981_869118_+	PRK00405, rpoB, DNA-directed RNA polymerase subunit beta; Reviewed	NA|50aa|up_1|CP012672.1_869117_869267_+	NA	NA|1430aa|up_0|CP012672.1_869259_873549_+	PRK00566, PRK00566, DNA-directed RNA polymerase subunit beta'; Provisional	NA|210aa|down_0|CP012672.1_876313_876943_+	PRK10809, PRK10809, 30S ribosomal protein S5 alanine N-acetyltransferase	NA|315aa|down_1|CP012672.1_877136_878081_+	pfam02548, Pantoate_transf, Ketopantoate hydroxymethyltransferase	NA|699aa|down_2|CP012672.1_878163_880260_-	pfam13519, VWA_2, von Willebrand factor type A domain	NA|315aa|down_3|CP012672.1_880256_881201_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|620aa|down_4|CP012672.1_881475_883335_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|480aa|down_5|CP012672.1_883367_884807_-	NF012200, choice_anch_D, choice-of-anchor D domain-containing protein	NA|416aa|down_6|CP012672.1_884854_886102_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|327aa|down_7|CP012672.1_886098_887079_-	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|333aa|down_8|CP012672.1_887195_888194_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|295aa|down_9|CP012672.1_888308_889193_-	NA
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	7	1780652-1780766	5	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GGCCAGGCGTGAAGGGGGTCGAACCCCC	28	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|322aa|up_9|CP012672.1_1764714_1765680_-,NA|126aa|up_4|CP012672.1_1772170_1772548_+,NA|356aa|down_1|CP012672.1_1782202_1783270_+,NA|271aa|down_2|CP012672.1_1783480_1784293_+,NA|67aa|down_3|CP012672.1_1784912_1785113_-	NA|322aa|up_9|CP012672.1_1764714_1765680_-	NA	NA|842aa|up_8|CP012672.1_1765754_1768280_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|337aa|up_7|CP012672.1_1768276_1769287_-	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|363aa|up_6|CP012672.1_1769283_1770372_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|414aa|up_5|CP012672.1_1770646_1771888_+	TIGR04063, Glycosyl_transferase_group_1, PEP-CTERM/exosortase A-associated glycosyltransferase, Daro_2409 family	NA|126aa|up_4|CP012672.1_1772170_1772548_+	NA	NA|401aa|up_3|CP012672.1_1772747_1773950_-	cd02253, DmpA, L-Aminopeptidase D-amidase/D-esterase (DmpA) family; DmpA catalyzes the release of N-terminal D and L amino acids from peptide susbtrates	NA|834aa|up_2|CP012672.1_1773946_1776448_-	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|823aa|up_1|CP012672.1_1776626_1779095_+	TIGR01073, ATP-dependent_DNA_helicase_PcrA, ATP-dependent DNA helicase PcrA	NA|358aa|up_0|CP012672.1_1779125_1780199_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|326aa|down_0|CP012672.1_1781015_1781993_+	pfam08308, PEGA, PEGA domain	NA|356aa|down_1|CP012672.1_1782202_1783270_+	NA	NA|271aa|down_2|CP012672.1_1783480_1784293_+	NA	NA|67aa|down_3|CP012672.1_1784912_1785113_-	NA	NA|529aa|down_4|CP012672.1_1785165_1786752_-	cd17919, DEXHc_Snf, DEXH/Q-box helicase domain of DEAD-like helicase Snf family proteins	NA|1058aa|down_5|CP012672.1_1787139_1790313_+	cd14955, NHL_like_4, Uncharacterized NHL-repeat domain in bacterial and archaeal proteins	NA|361aa|down_6|CP012672.1_1790403_1791486_-	PHA00370, III, attachment protein	NA|813aa|down_7|CP012672.1_1792095_1794534_+	smart00637, CBD_II, CBD_II domain	NA|387aa|down_8|CP012672.1_1794654_1795815_+	COG5184, ATS1, Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton]	NA|324aa|down_9|CP012672.1_1795892_1796864_-	cd12169, PGDH_like_1, Putative D-3-Phosphoglycerate Dehydrogenases
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	12	3692647-3692765	10	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGGGGACCCCGCCGAACTCGCCTGAGACTTTT	32	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|257aa|up_7|CP012672.1_3675321_3676092_-,NA|94aa|up_6|CP012672.1_3676117_3676399_-,NA|89aa|up_2|CP012672.1_3681772_3682039_-,NA|157aa|down_7|CP012672.1_3701353_3701824_-	NA|70aa|up_9|CP012672.1_3672411_3672621_+	PRK11618, PRK11618, inner membrane ABC transporter permease protein YjfF; Provisional	NA|793aa|up_8|CP012672.1_3672740_3675119_+	pfam07944, Glyco_hydro_127, Beta-L-arabinofuranosidase, GH127	NA|257aa|up_7|CP012672.1_3675321_3676092_-	NA	NA|94aa|up_6|CP012672.1_3676117_3676399_-	NA	NA|466aa|up_5|CP012672.1_3676826_3678224_+	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|608aa|up_4|CP012672.1_3678254_3680078_-	pfam05960, DUF885, Bacterial protein of unknown function (DUF885)	NA|405aa|up_3|CP012672.1_3680398_3681613_+	cd07385, MPP_YkuE_C, Bacillus subtilis YkuE and related proteins, C-terminal metallophosphatase domain	NA|89aa|up_2|CP012672.1_3681772_3682039_-	NA	NA|530aa|up_1|CP012672.1_3682202_3683792_+	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|798aa|up_0|CP012672.1_3684112_3686506_+	pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9	NA|581aa|down_0|CP012672.1_3692920_3694663_-	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|278aa|down_1|CP012672.1_3695164_3695998_+	pfam06724, DUF1206, Domain of Unknown Function (DUF1206)	NA|260aa|down_2|CP012672.1_3696067_3696847_+	cd07729, AHL_lactonase_MBL-fold, quorum-quenching N-acyl-homoserine lactonase, MBL-fold metallo-hydrolase domain	NA|116aa|down_3|CP012672.1_3696950_3697298_+	pfam13453, zf-TFIIB, Transcription factor zinc-finger	NA|280aa|down_4|CP012672.1_3697343_3698183_-	pfam13230, GATase_4, Glutamine amidotransferases class-II	NA|462aa|down_5|CP012672.1_3698175_3699561_-	COG2308, COG2308, Uncharacterized conserved protein [Function unknown]	NA|502aa|down_6|CP012672.1_3699568_3701074_-	COG2170, COG2170, Uncharacterized conserved protein [Function unknown]	NA|157aa|down_7|CP012672.1_3701353_3701824_-	NA	NA|279aa|down_8|CP012672.1_3702051_3702888_-	pfam02099, Josephin, Josephin	NA|414aa|down_9|CP012672.1_3703074_3704316_+	pfam13679, Methyltransf_32, Methyltransferase domain
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	14	4289244-4289611	2	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GCANCGNGCGGTCGGACCGCCCNC	24	0	0	NA	NA	NA	8	8	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|94aa|up_3|CP012672.1_4285488_4285770_-,NA|194aa|down_0|CP012672.1_4289799_4290381_-,NA|405aa|down_2|CP012672.1_4291705_4292920_-,NA|247aa|down_6|CP012672.1_4297153_4297894_+,NA|255aa|down_8|CP012672.1_4299049_4299814_+,NA|138aa|down_9|CP012672.1_4299968_4300382_+	NA|496aa|up_9|CP012672.1_4276184_4277672_-	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|766aa|up_8|CP012672.1_4277911_4280209_-	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|534aa|up_7|CP012672.1_4280615_4282217_+	PRK10819, PRK10819, transport protein TonB; Provisional	NA|113aa|up_6|CP012672.1_4282227_4282566_-	pfam01491, Frataxin_Cyay, Frataxin-like domain	NA|483aa|up_5|CP012672.1_4282623_4284072_-	cd07146, ALDH_PhpJ, Streptomyces putative phosphonoformaldehyde dehydrogenase PhpJ-like	NA|426aa|up_4|CP012672.1_4284104_4285382_-	TIGR02335, phosphonoacetate_hydrolase, phosphonoacetate hydrolase	NA|94aa|up_3|CP012672.1_4285488_4285770_-	NA	NA|247aa|up_2|CP012672.1_4286033_4286774_-	cd00475, Cis_IPPS, Cis (Z)-Isoprenyl Diphosphate Synthases	NA|457aa|up_1|CP012672.1_4287059_4288430_-	PRK11100, PRK11100, sensory histidine kinase CreC; Provisional	NA|233aa|up_0|CP012672.1_4288433_4289132_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|194aa|down_0|CP012672.1_4289799_4290381_-	NA	NA|439aa|down_1|CP012672.1_4290392_4291709_-	pfam12810, Gly_rich, Glycine rich protein	NA|405aa|down_2|CP012672.1_4291705_4292920_-	NA	NA|289aa|down_3|CP012672.1_4293074_4293941_-	sd00006, TPR, Tetratricopeptide repeat	NA|480aa|down_4|CP012672.1_4294205_4295645_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|431aa|down_5|CP012672.1_4295649_4296942_+	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|247aa|down_6|CP012672.1_4297153_4297894_+	NA	NA|390aa|down_7|CP012672.1_4297890_4299060_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|255aa|down_8|CP012672.1_4299049_4299814_+	NA	NA|138aa|down_9|CP012672.1_4299968_4300382_+	NA
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	16	4412445-4412672	13	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGCGCGGACCCGACCGTGGTTGCG	24	0	0	NA	NA	NA	3	3	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|243aa|up_5|CP012672.1_4403613_4404342_+,NA|89aa|up_2|CP012672.1_4406262_4406529_+,NA|590aa|up_1|CP012672.1_4406611_4408381_-,NA|291aa|down_3|CP012672.1_4415671_4416544_-	NA|517aa|up_9|CP012672.1_4392238_4393789_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|271aa|up_8|CP012672.1_4394313_4395126_+	pfam08308, PEGA, PEGA domain	NA|1603aa|up_7|CP012672.1_4395435_4400244_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|391aa|up_6|CP012672.1_4401587_4402760_-	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	NA|243aa|up_5|CP012672.1_4403613_4404342_+	NA	NA|325aa|up_4|CP012672.1_4404476_4405451_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|148aa|up_3|CP012672.1_4405518_4405962_-	pfam09537, DUF2383, Domain of unknown function (DUF2383)	NA|89aa|up_2|CP012672.1_4406262_4406529_+	NA	NA|590aa|up_1|CP012672.1_4406611_4408381_-	NA	NA|1281aa|up_0|CP012672.1_4408523_4412366_-	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|308aa|down_0|CP012672.1_4413094_4414018_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|268aa|down_1|CP012672.1_4414181_4414985_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|175aa|down_2|CP012672.1_4414858_4415383_-	pfam03055, RPE65, Retinal pigment epithelial membrane protein	NA|291aa|down_3|CP012672.1_4415671_4416544_-	NA	NA|195aa|down_4|CP012672.1_4416540_4417125_-	cd14958, NHL_PAL_like, Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL, EC 4	NA|177aa|down_5|CP012672.1_4417136_4417667_-	cd14958, NHL_PAL_like, Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL, EC 4	NA|209aa|down_6|CP012672.1_4417688_4418315_+	pfam13305, WHG, WHG domain	NA|1339aa|down_7|CP012672.1_4418618_4422635_+	PRK09510, tolA, cell envelope integrity inner membrane protein TolA; Provisional	NA|454aa|down_8|CP012672.1_4422949_4424311_-	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|366aa|down_9|CP012672.1_4424543_4425641_-	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	19	5786894-5787007	16	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CCGCATACAGATCTGTATGCTGAGCTAC	28	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|128aa|up_7|CP012672.1_5778290_5778674_-,NA|78aa|up_2|CP012672.1_5784834_5785068_-,NA|144aa|up_0|CP012672.1_5786421_5786853_+,NA|273aa|down_3|CP012672.1_5790717_5791536_-,NA|362aa|down_5|CP012672.1_5792933_5794019_+	NA|523aa|up_9|CP012672.1_5774036_5775605_-	PRK05022, PRK05022, nitric oxide reductase transcriptional regulator NorR	NA|773aa|up_8|CP012672.1_5775810_5778129_+	COG3256, NorB, Nitric oxide reductase large subunit [Inorganic ion transport and metabolism]	NA|128aa|up_7|CP012672.1_5778290_5778674_-	NA	NA|970aa|up_6|CP012672.1_5778766_5781676_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|578aa|up_5|CP012672.1_5781678_5783412_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|137aa|up_4|CP012672.1_5783621_5784032_-	cd04584, CBS_pair_AcuB_like, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the ACT domain	NA|139aa|up_3|CP012672.1_5784421_5784838_-	pfam01844, HNH, HNH endonuclease	NA|78aa|up_2|CP012672.1_5784834_5785068_-	NA	NA|139aa|up_1|CP012672.1_5785310_5785727_-	TIGR03300, assembly_YfgL, outer membrane assembly lipoprotein YfgL	NA|144aa|up_0|CP012672.1_5786421_5786853_+	NA	NA|262aa|down_0|CP012672.1_5787107_5787893_-	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|225aa|down_1|CP012672.1_5788269_5788944_+	pfam00582, Usp, Universal stress protein family	NA|453aa|down_2|CP012672.1_5789161_5790520_+	pfam03629, SASA, Carbohydrate esterase, sialic acid-specific acetylesterase	NA|273aa|down_3|CP012672.1_5790717_5791536_-	NA	NA|226aa|down_4|CP012672.1_5791567_5792245_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|362aa|down_5|CP012672.1_5792933_5794019_+	NA	NA|259aa|down_6|CP012672.1_5794057_5794834_+	NF012181, MSCRAMM_SdrD, MSCRAMM family adhesin SdrD	NA|279aa|down_7|CP012672.1_5794985_5795822_+	pfam02311, AraC_binding, AraC-like ligand binding domain	NA|207aa|down_8|CP012672.1_5796296_5796917_+	cd02175, GH16_lichenase, lichenase, member of glycosyl hydrolase family 16	NA|918aa|down_9|CP012672.1_5797130_5799884_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	23	6793013-6793378	4	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CCTCAACCTGGGTCGGGT	18	1	4	6793148-6793169|6793148-6793169|6793148-6793169|6793148-6793169	CP012672.1_5283827-5283806|CP012672.1_3204433-3204412|CP012672.1_6308340-6308361|CP012672.1_11564550-11564571	NA	7	7	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|218aa|up_8|CP012672.1_6782905_6783559_-,NA|150aa|up_6|CP012672.1_6785652_6786102_-,NA|189aa|up_2|CP012672.1_6789078_6789645_+,NA|146aa|down_0|CP012672.1_6793498_6793936_+,NA|253aa|down_2|CP012672.1_6795220_6795979_-,NA|242aa|down_3|CP012672.1_6796673_6797399_+,NA|186aa|down_6|CP012672.1_6799045_6799603_+,NA|125aa|down_7|CP012672.1_6799655_6800030_+,NA|198aa|down_8|CP012672.1_6800498_6801092_-,NA|172aa|down_9|CP012672.1_6801091_6801607_-	NA|1643aa|up_9|CP012672.1_6771143_6776072_-	PRK09751, PRK09751, putative ATP-dependent helicase Lhr; Provisional	NA|218aa|up_8|CP012672.1_6782905_6783559_-	NA	NA|510aa|up_7|CP012672.1_6784085_6785615_+	pfam16683, TGase_elicitor, Transglutaminase elicitor	NA|150aa|up_6|CP012672.1_6785652_6786102_-	NA	NA|229aa|up_5|CP012672.1_6786198_6786885_+	pfam07081, DUF1349, Protein of unknown function (DUF1349)	NA|401aa|up_4|CP012672.1_6787229_6788432_+	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|104aa|up_3|CP012672.1_6788540_6788852_+	pfam12823, DUF3817, Domain of unknown function (DUF3817)	NA|189aa|up_2|CP012672.1_6789078_6789645_+	NA	NA|267aa|up_1|CP012672.1_6789910_6790711_+	pfam13527, Acetyltransf_9, Acetyltransferase (GNAT) domain	NA|612aa|up_0|CP012672.1_6790751_6792587_-	pfam01401, Peptidase_M2, Angiotensin-converting enzyme	NA|146aa|down_0|CP012672.1_6793498_6793936_+	NA	NA|128aa|down_1|CP012672.1_6794496_6794880_+	cd08351, ChaP_like, ChaP, an enzyme involved in the biosynthesis of the antitumor agent chartreusin (cha), and similar proteins	NA|253aa|down_2|CP012672.1_6795220_6795979_-	NA	NA|242aa|down_3|CP012672.1_6796673_6797399_+	NA	NA|292aa|down_4|CP012672.1_6797398_6798274_+	pfam14332, DUF4388, Domain of unknown function (DUF4388)	NA|268aa|down_5|CP012672.1_6798270_6799074_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|186aa|down_6|CP012672.1_6799045_6799603_+	NA	NA|125aa|down_7|CP012672.1_6799655_6800030_+	NA	NA|198aa|down_8|CP012672.1_6800498_6801092_-	NA	NA|172aa|down_9|CP012672.1_6801091_6801607_-	NA
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	25	7552432-7552498	20	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGCGGGCCGGAAGCTTCCGGCGCGG	25	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|107aa|up_9|CP012672.1_7542260_7542581_-,NA|71aa|up_6|CP012672.1_7545468_7545681_+,NA|223aa|up_4|CP012672.1_7546536_7547205_+,NA|100aa|down_0|CP012672.1_7552891_7553191_+,NA|262aa|down_2|CP012672.1_7554339_7555125_-	NA|107aa|up_9|CP012672.1_7542260_7542581_-	NA	NA|426aa|up_8|CP012672.1_7542654_7543932_-	cd03798, GT4_WlbH-like, Bordetella parapertussis WlbH and similar proteins	NA|413aa|up_7|CP012672.1_7544106_7545345_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|71aa|up_6|CP012672.1_7545468_7545681_+	NA	NA|261aa|up_5|CP012672.1_7545632_7546415_-	PRK00208, thiG, thiazole synthase; Reviewed	NA|223aa|up_4|CP012672.1_7546536_7547205_+	NA	NA|323aa|up_3|CP012672.1_7547191_7548160_+	pfam00413, Peptidase_M10, Matrixin	NA|371aa|up_2|CP012672.1_7548249_7549362_+	cd06142, RNaseD_exo, DEDDy 3'-5' exonuclease domain of Ribonuclease D and similar proteins	NA|587aa|up_1|CP012672.1_7549466_7551227_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|350aa|up_0|CP012672.1_7551168_7552218_-	TIGR00433, biotin_synthase, biotin synthase	NA|100aa|down_0|CP012672.1_7552891_7553191_+	NA	NA|351aa|down_1|CP012672.1_7553130_7554183_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|262aa|down_2|CP012672.1_7554339_7555125_-	NA	NA|466aa|down_3|CP012672.1_7555138_7556536_-	pfam01964, ThiC_Rad_SAM, Radical SAM ThiC family	NA|153aa|down_4|CP012672.1_7556648_7557107_-	COG0848, ExbD, Biopolymer transport protein [Intracellular trafficking and secretion]	NA|139aa|down_5|CP012672.1_7557112_7557529_-	COG0848, ExbD, Biopolymer transport protein [Intracellular trafficking and secretion]	NA|239aa|down_6|CP012672.1_7557642_7558359_-	TIGR02796, Protein_TolQ, TolQ protein	NA|261aa|down_7|CP012672.1_7558527_7559310_-	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|1074aa|down_8|CP012672.1_7559492_7562714_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|329aa|down_9|CP012672.1_7562744_7563731_+	PTZ00146, PTZ00146, fibrillarin; Provisional
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	26	7747672-7747807	5	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CCTCGACGCCGGCGCGCTCG	20	1	1	7747692-7747709	CP012672.1_1481223-1481240	NA	3	3	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|309aa|up_8|CP012672.1_7736277_7737204_-,NA|491aa|up_7|CP012672.1_7737148_7738621_-,NA|271aa|up_5|CP012672.1_7739921_7740734_+,NA|198aa|up_0|CP012672.1_7746687_7747281_+,NA|138aa|down_0|CP012672.1_7748356_7748770_-,NA|60aa|down_1|CP012672.1_7748923_7749103_-,NA|194aa|down_7|CP012672.1_7754515_7755097_-	NA|1115aa|up_9|CP012672.1_7732936_7736281_-	pfam12770, CHAT, CHAT domain	NA|309aa|up_8|CP012672.1_7736277_7737204_-	NA	NA|491aa|up_7|CP012672.1_7737148_7738621_-	NA	NA|315aa|up_6|CP012672.1_7738813_7739758_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|271aa|up_5|CP012672.1_7739921_7740734_+	NA	NA|411aa|up_4|CP012672.1_7740730_7741963_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|374aa|up_3|CP012672.1_7741959_7743081_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|383aa|up_2|CP012672.1_7743220_7744369_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|490aa|up_1|CP012672.1_7744773_7746243_-	pfam13676, TIR_2, TIR domain	NA|198aa|up_0|CP012672.1_7746687_7747281_+	NA	NA|138aa|down_0|CP012672.1_7748356_7748770_-	NA	NA|60aa|down_1|CP012672.1_7748923_7749103_-	NA	NA|259aa|down_2|CP012672.1_7749625_7750402_+	pfam01339, CheB_methylest, CheB methylesterase	NA|122aa|down_3|CP012672.1_7750107_7750473_+	pfam03705, CheR_N, CheR methyltransferase, all-alpha domain	NA|299aa|down_4|CP012672.1_7750498_7751395_+	cd06561, AlkD_like, A new structural DNA glycosylase	NA|183aa|down_5|CP012672.1_7751422_7751971_-	COG1633, COG1633, Uncharacterized conserved protein [Function unknown]	NA|769aa|down_6|CP012672.1_7752136_7754443_-	pfam00930, DPPIV_N, Dipeptidyl peptidase IV (DPP IV) N-terminal region	NA|194aa|down_7|CP012672.1_7754515_7755097_-	NA	NA|445aa|down_8|CP012672.1_7755093_7756428_-	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|850aa|down_9|CP012672.1_7756529_7759079_-	pfam12029, DUF3516, Domain of unknown function (DUF3516)
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	29	8411164-8411264	22	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GAGACCCGACCCGGGTTGCCCCCGTAA	27	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|104aa|up_9|CP012672.1_8397019_8397331_-,NA|203aa|up_1|CP012672.1_8408196_8408805_+,NA|250aa|down_0|CP012672.1_8411580_8412330_-,NA|488aa|down_3|CP012672.1_8415714_8417178_-,NA|347aa|down_5|CP012672.1_8418526_8419567_+	NA|104aa|up_9|CP012672.1_8397019_8397331_-	NA	NA|287aa|up_8|CP012672.1_8397563_8398424_+	COG3568, ElsH, Metal-dependent hydrolase [General function prediction only]	NA|303aa|up_7|CP012672.1_8398610_8399519_+	cd01561, CBS_like, CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase	NA|407aa|up_6|CP012672.1_8399506_8400727_-	COG1008, NuoM, NADH:ubiquinone oxidoreductase subunit 4 (chain M) [Energy production and conversion]	NA|480aa|up_5|CP012672.1_8401106_8402546_-	PRK06590, PRK06590, NADH:ubiquinone oxidoreductase subunit L; Reviewed	NA|1020aa|up_4|CP012672.1_8402535_8405595_-	pfam10070, DUF2309, Uncharacterized protein conserved in bacteria (DUF2309)	NA|560aa|up_3|CP012672.1_8405671_8407351_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|196aa|up_2|CP012672.1_8407612_8408200_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|203aa|up_1|CP012672.1_8408196_8408805_+	NA	NA|285aa|up_0|CP012672.1_8408953_8409808_-	PRK09562, mazG, nucleoside triphosphate pyrophosphohydrolase; Reviewed	NA|250aa|down_0|CP012672.1_8411580_8412330_-	NA	NA|417aa|down_1|CP012672.1_8412818_8414069_-	PRK07538, PRK07538, hypothetical protein; Provisional	NA|528aa|down_2|CP012672.1_8414170_8415754_+	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|488aa|down_3|CP012672.1_8415714_8417178_-	NA	NA|356aa|down_4|CP012672.1_8417177_8418245_-	sd00008, TPR_YbbN, C-terminal Tetratricopeptide repeat (TPR) region of YbbN and similar motifs	NA|347aa|down_5|CP012672.1_8418526_8419567_+	NA	NA|179aa|down_6|CP012672.1_8419610_8420147_-	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|704aa|down_7|CP012672.1_8420791_8422903_+	pfam03512, Glyco_hydro_52, Glycosyl hydrolase family 52	NA|346aa|down_8|CP012672.1_8422899_8423937_+	cd05265, SDR_a1, atypical (a) SDRs, subgroup 1	NA|417aa|down_9|CP012672.1_8424048_8425299_-	COG3866, PelB, Pectate lyase [Carbohydrate transport and metabolism]
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	30	8635528-8639542	6,5,23	CRT,PILER-CR,CRISPRCasFinder	no	RT	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Unclear	CTCTCCGCCGCTGAAAGGCGGNNNNCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC,CTCTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	40,36,36	3	5	8635564-8635609|8636384-8636421|8636384-8636421|8636532-8636569|8636532-8636569	CP012672.1_676501-676546|CP012672.1_7309292-7309255|CP012672.1_7353333-7353296|CP012672.1_7314083-7314120|CP012672.1_7358124-7358161	NA:NA:NA	54,53,53	54	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|314aa|up_1|CP012672.1_8633582_8634524_-,NA|36aa|up_0|CP012672.1_8634791_8634899_+,NA|425aa|down_4|CP012672.1_8646496_8647771_+,NA|383aa|down_5|CP012672.1_8647872_8649021_-	NA|236aa|up_9|CP012672.1_8616117_8616825_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|283aa|up_8|CP012672.1_8617440_8618289_+	TIGR03425, urea_degr_2, urea carboxylase-associated protein 2	NA|224aa|up_7|CP012672.1_8618285_8618957_+	TIGR03424, urea_degr_1, urea carboxylase-associated protein 1	NA|1249aa|up_6|CP012672.1_8618953_8622700_+	TIGR02712, Includes:_Allophanate_hydrolase, urea carboxylase	NA|547aa|up_5|CP012672.1_8622692_8624333_+	TIGR03428, ureacarb_perm, permease, urea carboxylase system	NA|525aa|up_4|CP012672.1_8624356_8625931_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|1929aa|up_3|CP012672.1_8626188_8631975_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|217aa|up_2|CP012672.1_8632830_8633481_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|314aa|up_1|CP012672.1_8633582_8634524_-	NA	NA|36aa|up_0|CP012672.1_8634791_8634899_+	NA	NA|516aa|down_0|CP012672.1_8639781_8641329_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|543aa|down_1|CP012672.1_8641367_8642996_-	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|366aa|down_2|CP012672.1_8643011_8644109_-	pfam08308, PEGA, PEGA domain	RT|451aa|down_3|CP012672.1_8644231_8645584_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|425aa|down_4|CP012672.1_8646496_8647771_+	NA	NA|383aa|down_5|CP012672.1_8647872_8649021_-	NA	NA|178aa|down_6|CP012672.1_8649173_8649707_+	pfam01668, SmpB, SmpB protein	NA|147aa|down_7|CP012672.1_8649804_8650245_-	PRK06500, PRK06500, SDR family oxidoreductase	NA|316aa|down_8|CP012672.1_8650410_8651358_+	COG4977, COG4977, Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription]	NA|483aa|down_9|CP012672.1_8651389_8652838_-	PRK01642, cls, cardiolipin synthetase; Reviewed
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	31	9001635-9001725	24	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGGCCCGGCGCGGGGGATGGGCCGAT	26	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|202aa|up_9|CP012672.1_8984349_8984955_-,NA|171aa|up_4|CP012672.1_8995858_8996371_-,NA|183aa|up_0|CP012672.1_9001007_9001556_+,NA|224aa|down_2|CP012672.1_9004973_9005645_+	NA|202aa|up_9|CP012672.1_8984349_8984955_-	NA	NA|1097aa|up_8|CP012672.1_8985006_8988297_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|1030aa|up_7|CP012672.1_8988631_8991721_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|1084aa|up_6|CP012672.1_8991717_8994969_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|171aa|up_5|CP012672.1_8995173_8995686_+	cd10028, UDG-F2_TDG_MUG, Uracil DNA glycosylase family 2, includes thymine DNA glycosylase, mismatch-specific uracil DNA glycosylase and similar proteins	NA|171aa|up_4|CP012672.1_8995858_8996371_-	NA	NA|837aa|up_3|CP012672.1_8996737_8999248_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|191aa|up_2|CP012672.1_8999481_9000054_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|164aa|up_1|CP012672.1_9000505_9000997_+	pfam12732, YtxH, YtxH-like protein	NA|183aa|up_0|CP012672.1_9001007_9001556_+	NA	NA|374aa|down_0|CP012672.1_9001864_9002986_-	cd06161, S2P-M50_SpoIVFB, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|394aa|down_1|CP012672.1_9003392_9004574_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|224aa|down_2|CP012672.1_9004973_9005645_+	NA	NA|1951aa|down_3|CP012672.1_9006314_9012167_+	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|169aa|down_4|CP012672.1_9012424_9012931_+	COG3216, COG3216, Uncharacterized protein conserved in bacteria [Function unknown]	NA|619aa|down_5|CP012672.1_9013134_9014991_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|414aa|down_6|CP012672.1_9015361_9016603_-	COG1502, Cls, Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes [Lipid metabolism]	NA|495aa|down_7|CP012672.1_9016863_9018348_-	pfam10092, DUF2330, Uncharacterized protein conserved in bacteria (DUF2330)	NA|486aa|down_8|CP012672.1_9018629_9020087_-	PRK05335, PRK05335, tRNA (uracil-5-)-methyltransferase Gid; Reviewed	NA|866aa|down_9|CP012672.1_9020121_9022719_-	PRK06599, PRK06599, DNA topoisomerase I; Validated
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	32	9280094-9280422	25,6	CRISPRCasFinder,PILER-CR	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GGCTTCAATGGGGCCGCCGCCTCTCAGCGGCGGAA,TTTCCGCCGCTGAAAGGCGGCGGCCCCATTGAAGC	35,35	0	0	NA	NA	NA:NA	4,3	4	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|361aa|up_8|CP012672.1_9268061_9269144_-,NA|189aa|up_5|CP012672.1_9272443_9273010_+,NA|210aa|up_1|CP012672.1_9277793_9278423_-,NA|338aa|up_0|CP012672.1_9278562_9279576_-,NA|264aa|down_0|CP012672.1_9281007_9281799_+,NA|143aa|down_2|CP012672.1_9282436_9282865_+,NA|67aa|down_3|CP012672.1_9282904_9283105_+,NA|349aa|down_6|CP012672.1_9285547_9286594_+	NA|149aa|up_9|CP012672.1_9267618_9268065_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|361aa|up_8|CP012672.1_9268061_9269144_-	NA	NA|337aa|up_7|CP012672.1_9269130_9270141_-	cd04176, Rap2, Rap2 family GTPase consists of Rap2a, Rap2b, and Rap2c	NA|503aa|up_6|CP012672.1_9270740_9272249_+	smart00637, CBD_II, CBD_II domain	NA|189aa|up_5|CP012672.1_9272443_9273010_+	NA	NA|542aa|up_4|CP012672.1_9273279_9274905_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|321aa|up_3|CP012672.1_9275424_9276387_-	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|403aa|up_2|CP012672.1_9276536_9277745_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|210aa|up_1|CP012672.1_9277793_9278423_-	NA	NA|338aa|up_0|CP012672.1_9278562_9279576_-	NA	NA|264aa|down_0|CP012672.1_9281007_9281799_+	NA	NA|149aa|down_1|CP012672.1_9281902_9282349_+	COG1832, COG1832, Predicted CoA-binding protein [General function prediction only]	NA|143aa|down_2|CP012672.1_9282436_9282865_+	NA	NA|67aa|down_3|CP012672.1_9282904_9283105_+	NA	NA|152aa|down_4|CP012672.1_9283438_9283894_-	pfam07883, Cupin_2, Cupin domain	NA|343aa|down_5|CP012672.1_9284172_9285201_+	TIGR04470, hypothetical_protein_ALIPUT_00462, radical SAM mobile pair protein B	NA|349aa|down_6|CP012672.1_9285547_9286594_+	NA	NA|971aa|down_7|CP012672.1_9286590_9289503_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|260aa|down_8|CP012672.1_9289904_9290684_+	COG5000, NtrY, Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation [Signal transduction mechanisms]	NA|153aa|down_9|CP012672.1_9290709_9291168_+	cd07176, terB, tellurite resistance protein terB
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	36	9865764-9865842	29	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGTCCCCGGGTGCCGCAGCGCGGG	24	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|70aa|up_6|CP012672.1_9857478_9857688_+,NA|209aa|down_1|CP012672.1_9867089_9867716_-	NA|592aa|up_9|CP012672.1_9854228_9856004_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|148aa|up_8|CP012672.1_9856194_9856638_-	pfam06094, GGACT, Gamma-glutamyl cyclotransferase, AIG2-like	NA|130aa|up_7|CP012672.1_9857038_9857428_+	pfam03544, TonB_C, Gram-negative bacterial TonB protein C-terminal	NA|70aa|up_6|CP012672.1_9857478_9857688_+	NA	NA|447aa|up_5|CP012672.1_9857889_9859230_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|224aa|up_4|CP012672.1_9859669_9860341_-	pfam09536, DUF2378, Protein of unknown function (DUF2378)	NA|132aa|up_3|CP012672.1_9861026_9861422_+	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|250aa|up_2|CP012672.1_9861614_9862364_+	pfam13539, Peptidase_M15_4, D-alanyl-D-alanine carboxypeptidase	NA|163aa|up_1|CP012672.1_9862400_9862889_-	COG1607, COG1607, Acyl-CoA hydrolase [Lipid metabolism]	NA|941aa|up_0|CP012672.1_9862926_9865749_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|393aa|down_0|CP012672.1_9865847_9867026_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|209aa|down_1|CP012672.1_9867089_9867716_-	NA	NA|455aa|down_2|CP012672.1_9867863_9869228_-	PRK12678, PRK12678, transcription termination factor Rho; Provisional	NA|260aa|down_3|CP012672.1_9869375_9870155_+	COG1788, AtoD, Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit [Lipid metabolism]	NA|222aa|down_4|CP012672.1_9870180_9870846_+	TIGR02428, 3-oxoadipate_CoA-transferase_subunit_B, 3-oxoacid CoA-transferase, B subunit	NA|286aa|down_5|CP012672.1_9870856_9871714_+	PRK05654, PRK05654, acetyl-CoA carboxylase carboxyltransferase subunit beta	NA|441aa|down_6|CP012672.1_9871710_9873033_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|351aa|down_7|CP012672.1_9873117_9874170_+	cd05233, SDR_c, classical (c) SDRs	NA|481aa|down_8|CP012672.1_9874335_9875778_+	PRK14314, glmM, phosphoglucosamine mutase; Provisional	NA|335aa|down_9|CP012672.1_9876002_9877007_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	38	10198365-10198452	30	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CAAAGGCGCAAAGGCGCGAGAAGA	24	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|408aa|up_4|CP012672.1_10194123_10195347_-,NA|82aa|up_2|CP012672.1_10196693_10196939_+,NA|95aa|down_1|CP012672.1_10200265_10200550_-,NA|120aa|down_2|CP012672.1_10200742_10201102_+,NA|209aa|down_5|CP012672.1_10203076_10203703_+,NA|602aa|down_6|CP012672.1_10203919_10205725_+	NA|129aa|up_9|CP012672.1_10189770_10190157_-	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|95aa|up_8|CP012672.1_10190645_10190930_+	pfam07883, Cupin_2, Cupin domain	NA|310aa|up_7|CP012672.1_10190911_10191841_-	PRK00281, PRK00281, undecaprenyl-diphosphate phosphatase	NA|441aa|up_6|CP012672.1_10191892_10193215_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|253aa|up_5|CP012672.1_10193368_10194127_-	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|408aa|up_4|CP012672.1_10194123_10195347_-	NA	NA|222aa|up_3|CP012672.1_10196031_10196697_+	cd17569, REC_HupR-like, phosphoacceptor receiver (REC) domain of hydrogen uptake protein regulator (HupR) and similar domains	NA|82aa|up_2|CP012672.1_10196693_10196939_+	NA	NA|155aa|up_1|CP012672.1_10196827_10197292_-	cd07246, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|286aa|up_0|CP012672.1_10197383_10198241_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|524aa|down_0|CP012672.1_10198731_10200303_+	pfam07627, PSCyt3, Protein of unknown function (DUF1588)	NA|95aa|down_1|CP012672.1_10200265_10200550_-	NA	NA|120aa|down_2|CP012672.1_10200742_10201102_+	NA	NA|193aa|down_3|CP012672.1_10201389_10201968_+	COG0605, SodA, Superoxide dismutase [Inorganic ion transport and metabolism]	NA|326aa|down_4|CP012672.1_10201993_10202971_-	pfam14394, DUF4423, Domain of unknown function (DUF4423)	NA|209aa|down_5|CP012672.1_10203076_10203703_+	NA	NA|602aa|down_6|CP012672.1_10203919_10205725_+	NA	NA|160aa|down_7|CP012672.1_10205897_10206377_+	cd01285, nucleoside_deaminase, Nucleoside deaminases include adenosine, guanine and cytosine deaminases	NA|373aa|down_8|CP012672.1_10206455_10207574_-	pfam04116, FA_hydroxylase, Fatty acid hydroxylase superfamily	NA|551aa|down_9|CP012672.1_10208034_10209687_+	COG3934, COG3934, Endo-beta-mannanase [Carbohydrate transport and metabolism]
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	40	10642538-10642612	32	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GCCGGCGATATGCCCCGCGCGCGCC	25	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|209aa|up_2|CP012672.1_10640367_10640994_-,NA|277aa|up_1|CP012672.1_10641317_10642148_+,NA|279aa|down_6|CP012672.1_10651553_10652390_+,NA|95aa|down_9|CP012672.1_10653969_10654254_-	NA|142aa|up_9|CP012672.1_10632371_10632797_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|432aa|up_8|CP012672.1_10632872_10634168_+	COG4941, COG4941, Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription]	NA|276aa|up_7|CP012672.1_10634180_10635008_-	cd06260, DUF820, Domain of unknown function (DUF820)	NA|444aa|up_6|CP012672.1_10635120_10636452_-	PRK06062, PRK06062, hypothetical protein; Provisional	NA|158aa|up_5|CP012672.1_10636813_10637287_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|469aa|up_4|CP012672.1_10637305_10638712_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|400aa|up_3|CP012672.1_10638832_10640032_-	PRK00413, thrS, threonyl-tRNA synthetase; Reviewed	NA|209aa|up_2|CP012672.1_10640367_10640994_-	NA	NA|277aa|up_1|CP012672.1_10641317_10642148_+	NA	NA|111aa|up_0|CP012672.1_10642164_10642497_-	COG3357, COG3357, Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]	NA|853aa|down_0|CP012672.1_10642776_10645335_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|612aa|down_1|CP012672.1_10645655_10647491_+	PTZ00146, PTZ00146, fibrillarin; Provisional	NA|261aa|down_2|CP012672.1_10647550_10648333_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|280aa|down_3|CP012672.1_10648424_10649264_+	cd06124, cupin_NimR-like_N, AraC/XylS family transcriptional regulators similar to NimR, N-terminal cupin domain	NA|294aa|down_4|CP012672.1_10649349_10650231_-	COG1360, MotB, Flagellar motor protein [Cell motility and secretion]	NA|339aa|down_5|CP012672.1_10650511_10651528_+	COG0473, LeuB, Isocitrate/isopropylmalate dehydrogenase [Amino acid transport and metabolism]	NA|279aa|down_6|CP012672.1_10651553_10652390_+	NA	NA|270aa|down_7|CP012672.1_10652418_10653228_-	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|282aa|down_8|CP012672.1_10653224_10654070_-	NF033154, endonuc_SmrA, DNA endonuclease SmrA	NA|95aa|down_9|CP012672.1_10653969_10654254_-	NA
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	44	11068389-11068472	35	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	ATCAATACATCTTTGCAGTTCTTCACTC	28	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|551aa|up_1|CP012672.1_11062055_11063708_+,NA	NA|463aa|up_9|CP012672.1_11051379_11052768_+	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|617aa|up_8|CP012672.1_11053479_11055330_-	pfam02956, TT_ORF1, TT viral orf 1	NA|389aa|up_7|CP012672.1_11055404_11056571_+	cd00519, Lipase_3, Lipase (class 3)	NA|392aa|up_6|CP012672.1_11056619_11057795_+	cd00519, Lipase_3, Lipase (class 3)	NA|269aa|up_5|CP012672.1_11057862_11058669_-	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|270aa|up_4|CP012672.1_11058691_11059501_-	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|330aa|up_3|CP012672.1_11059497_11060487_-	cd03267, ABC_NatA_like, ATP-binding cassette domain of an uncharacterized transporter similar in sequence to NatA	NA|327aa|up_2|CP012672.1_11060620_11061601_+	cd05120, APH_ChoK_like, Aminoglycoside 3'-phosphotransferase and Choline Kinase family	NA|551aa|up_1|CP012672.1_11062055_11063708_+	NA	NA|994aa|up_0|CP012672.1_11063783_11066765_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|2526aa|down_0|CP012672.1_11068796_11076374_+	PRK05691, PRK05691, peptide synthase; Validated	NA|658aa|down_1|CP012672.1_11076418_11078392_+	COG2192, COG2192, Predicted carbamoyl transferase, NodU family [Posttranslational modification, protein turnover, chaperones]	NA|413aa|down_2|CP012672.1_11078433_11079672_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|419aa|down_3|CP012672.1_11079820_11081077_+	pfam03712, Cu2_monoox_C, Copper type II ascorbate-dependent monooxygenase, C-terminal domain	NA|441aa|down_4|CP012672.1_11082382_11083705_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|524aa|down_5|CP012672.1_11083982_11085554_+	pfam00932, LTD, Lamin Tail Domain	NA|539aa|down_6|CP012672.1_11085750_11087367_+	COG5520, COG5520, O-Glycosyl hydrolase [Cell envelope biogenesis, outer membrane]	NA|222aa|down_7|CP012672.1_11087405_11088071_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|326aa|down_8|CP012672.1_11088213_11089191_-	COG1183, PssA, Phosphatidylserine synthase [Lipid metabolism]	NA|119aa|down_9|CP012672.1_11089387_11089744_-	COG2329, COG2329, Uncharacterized enzyme involved in biosynthesis of extracellular polysaccharides [General function prediction only]
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	45	11381852-11381963	36	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CCCGCCCATGTCCGACATGCCCGTCGC	27	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|190aa|up_9|CP012672.1_11361588_11362158_-,NA|138aa|up_5|CP012672.1_11366526_11366940_-,NA|190aa|up_4|CP012672.1_11366975_11367545_-,NA|161aa|up_2|CP012672.1_11369459_11369942_-,NA|269aa|up_0|CP012672.1_11380957_11381764_+,NA|45aa|down_1|CP012672.1_11383854_11383989_-,NA|164aa|down_3|CP012672.1_11385497_11385989_-	NA|190aa|up_9|CP012672.1_11361588_11362158_-	NA	NA|414aa|up_8|CP012672.1_11362278_11363520_-	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|718aa|up_7|CP012672.1_11363886_11366040_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|115aa|up_6|CP012672.1_11366185_11366530_-	pfam05717, TnpB_IS66, IS66 Orf2 like protein	NA|138aa|up_5|CP012672.1_11366526_11366940_-	NA	NA|190aa|up_4|CP012672.1_11366975_11367545_-	NA	NA|418aa|up_3|CP012672.1_11368179_11369433_-	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|161aa|up_2|CP012672.1_11369459_11369942_-	NA	NA|3553aa|up_1|CP012672.1_11370017_11380676_-	pfam03534, SpvB, Salmonella virulence plasmid 65kDa B protein	NA|269aa|up_0|CP012672.1_11380957_11381764_+	NA	NA|286aa|down_0|CP012672.1_11382396_11383254_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|45aa|down_1|CP012672.1_11383854_11383989_-	NA	NA|487aa|down_2|CP012672.1_11383988_11385449_-	pfam08308, PEGA, PEGA domain	NA|164aa|down_3|CP012672.1_11385497_11385989_-	NA	NA|1239aa|down_4|CP012672.1_11385985_11389702_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|664aa|down_5|CP012672.1_11389698_11391690_-	pfam12770, CHAT, CHAT domain	NA|493aa|down_6|CP012672.1_11391950_11393429_+	pfam08308, PEGA, PEGA domain	NA|936aa|down_7|CP012672.1_11393428_11396236_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|242aa|down_8|CP012672.1_11396235_11396961_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|569aa|down_9|CP012672.1_11397028_11398735_+	cd07302, CHD, cyclase homology domain
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	46	11774625-11774818	37	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GCCCCAACCTGGGTCGGCTGACGGCCATCCGTCGAAGCCGCTCTCGCCCC	50	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|230aa|up_1|CP012672.1_11772316_11773006_-,NA|282aa|down_3|CP012672.1_11779214_11780060_+,NA|79aa|down_7|CP012672.1_11785008_11785245_-	NA|390aa|up_9|CP012672.1_11762985_11764155_+	cd05281, TDH, Threonine dehydrogenase	NA|397aa|up_8|CP012672.1_11764160_11765351_-	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|360aa|up_7|CP012672.1_11765627_11766707_+	cd14962, NHL_like_6, Uncharacterized NHL-repeat domain in bacterial proteins	NA|471aa|up_6|CP012672.1_11766739_11768152_+	pfam14224, DUF4331, Domain of unknown function (DUF4331)	NA|273aa|up_5|CP012672.1_11768263_11769082_-	PRK14959, PRK14959, DNA polymerase III subunits gamma and tau; Provisional	NA|466aa|up_4|CP012672.1_11769261_11770659_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|154aa|up_3|CP012672.1_11770732_11771194_-	COG5373, COG5373, Predicted membrane protein [Function unknown]	NA|355aa|up_2|CP012672.1_11771208_11772273_+	cd09083, EEP-1, Exonuclease-Endonuclease-Phosphatase domain; uncharacterized family 1	NA|230aa|up_1|CP012672.1_11772316_11773006_-	NA	NA|342aa|up_0|CP012672.1_11773262_11774288_-	cd10918, CE4_NodB_like_5s_6s, Putative catalytic NodB homology domain of PgaB, IcaB, and similar proteins which consist of a deformed (beta/alpha)8 barrel fold with 5- or 6-strands	NA|187aa|down_0|CP012672.1_11774856_11775417_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|246aa|down_1|CP012672.1_11775382_11776120_-	COG1040, ComFC, Predicted amidophosphoribosyltransferases [General function prediction only]	NA|982aa|down_2|CP012672.1_11776272_11779218_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|282aa|down_3|CP012672.1_11779214_11780060_+	NA	NA|490aa|down_4|CP012672.1_11780269_11781739_+	pfam13372, Alginate_exp, Alginate export	NA|649aa|down_5|CP012672.1_11781898_11783845_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|335aa|down_6|CP012672.1_11784007_11785012_-	cd01992, PP-ATPase, N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]	NA|79aa|down_7|CP012672.1_11785008_11785245_-	NA	NA|461aa|down_8|CP012672.1_11785395_11786778_-	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|751aa|down_9|CP012672.1_11786774_11789027_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	48	12240374-12240571	9	CRT	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	GGCGCCTCGGAGACCCGCGAAGCCGCGGGCGCCTCC	36	0	0	NA	NA	NA	3	3	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA,NA|260aa|down_0|CP012672.1_12241184_12241964_-,NA|206aa|down_2|CP012672.1_12243710_12244328_+,NA|177aa|down_7|CP012672.1_12251883_12252414_-,NA|83aa|down_9|CP012672.1_12253059_12253308_+	NA|367aa|up_9|CP012672.1_12226643_12227744_-	pfam04015, DUF362, Domain of unknown function (DUF362)	NA|215aa|up_8|CP012672.1_12228129_12228774_+	pfam12836, HHH_3, Helix-hairpin-helix motif	NA|496aa|up_7|CP012672.1_12229180_12230668_+	COG0773, MurC, UDP-N-acetylmuramate-alanine ligase [Cell envelope biogenesis, outer membrane]	NA|284aa|up_6|CP012672.1_12230664_12231516_+	cd01639, IMPase, IMPase, inositol monophosphatase and related domains	NA|229aa|up_5|CP012672.1_12232554_12233241_+	cd04221, MauL, Methylamine utilization protein MauL	NA|587aa|up_4|CP012672.1_12233244_12235005_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|521aa|up_3|CP012672.1_12235039_12236602_+	pfam03349, Toluene_X, Outer membrane protein transport protein (OMPP1/FadL/TodX)	NA|320aa|up_2|CP012672.1_12236725_12237685_+	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|351aa|up_1|CP012672.1_12237695_12238748_+	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|190aa|up_0|CP012672.1_12238749_12239319_+	pfam09858, DUF2085, Predicted membrane protein (DUF2085)	NA|260aa|down_0|CP012672.1_12241184_12241964_-	NA	NA|538aa|down_1|CP012672.1_12242026_12243640_+	cd05941, MCS, Malonyl-CoA synthetase (MCS)	NA|206aa|down_2|CP012672.1_12243710_12244328_+	NA	NA|701aa|down_3|CP012672.1_12244464_12246567_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|914aa|down_4|CP012672.1_12246580_12249322_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|392aa|down_5|CP012672.1_12249318_12250494_-	COG2865, COG2865, Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription]	NA|439aa|down_6|CP012672.1_12250567_12251884_-	pfam04519, Bactofilin, Polymer-forming cytoskeletal	NA|177aa|down_7|CP012672.1_12251883_12252414_-	NA	NA|199aa|down_8|CP012672.1_12252416_12253013_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|83aa|down_9|CP012672.1_12253059_12253308_+	NA
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	49	12429523-12429637	39	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGCCAACCTGGGTGGGTAGTCCTCAACCTGGGTCGGGTGGTT	42	0	0	NA	NA	NA	1	1	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|207aa|up_9|CP012672.1_12413116_12413737_+,NA|65aa|up_4|CP012672.1_12422806_12423001_+,NA|809aa|up_1|CP012672.1_12425681_12428108_+,NA|121aa|down_3|CP012672.1_12432998_12433361_+,NA|129aa|down_8|CP012672.1_12437934_12438321_+	NA|207aa|up_9|CP012672.1_12413116_12413737_+	NA	NA|508aa|up_8|CP012672.1_12413811_12415335_-	cd17346, MFS_DtpA_like, Dipeptide and tripeptide permease A (DtpA)-like subfamily of the Major Facilitator Superfamily of transporters	NA|402aa|up_7|CP012672.1_12415979_12417185_+	PRK00236, xerC, site-specific tyrosine recombinase XerC; Reviewed	NA|768aa|up_6|CP012672.1_12417181_12419485_+	cd16148, sulfatase_like, uncharacterized sulfatase subfamily	NA|737aa|up_5|CP012672.1_12419986_12422197_-	cd04280, ZnMc_astacin_like, Zinc-dependent metalloprotease, astacin_like subfamily or peptidase family M12A, a group of zinc-dependent proteolytic enzymes with a HExxH zinc-binding site/active site	NA|65aa|up_4|CP012672.1_12422806_12423001_+	NA	NA|283aa|up_3|CP012672.1_12423020_12423869_-	pfam03781, FGE-sulfatase, Sulfatase-modifying factor enzyme 1	NA|362aa|up_2|CP012672.1_12424585_12425671_+	pfam07728, AAA_5, AAA domain (dynein-related subfamily)	NA|809aa|up_1|CP012672.1_12425681_12428108_+	NA	NA|288aa|up_0|CP012672.1_12428406_12429270_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|319aa|down_0|CP012672.1_12429686_12430643_-	PRK14864, PRK14864, biofilm peroxide resistance protein BsmA	NA|350aa|down_1|CP012672.1_12430798_12431848_-	cd08417, PBP2_Nitroaromatics_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold	NA|107aa|down_2|CP012672.1_12431883_12432204_+	pfam03992, ABM, Antibiotic biosynthesis monooxygenase	NA|121aa|down_3|CP012672.1_12432998_12433361_+	NA	NA|488aa|down_4|CP012672.1_12433479_12434943_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|462aa|down_5|CP012672.1_12434939_12436325_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|223aa|down_6|CP012672.1_12436555_12437224_+	pfam03682, UPF0158, Uncharacterized protein family (UPF0158)	NA|183aa|down_7|CP012672.1_12437334_12437883_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|129aa|down_8|CP012672.1_12437934_12438321_+	NA	NA|339aa|down_9|CP012672.1_12438416_12439433_-	pfam08308, PEGA, PEGA domain
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	51	13553893-13554165	40	CRISPRCasFinder	no		cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Orphan	CGGCGTCCCCGCGCGGCGCGGCG	23	0	0	NA	NA	NA	4	4	Orphan	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|345aa|up_7|CP012672.1_13535466_13536501_+,NA|72aa|down_1|CP012672.1_13556030_13556246_-,NA|78aa|down_6|CP012672.1_13568150_13568384_+,NA|75aa|down_7|CP012672.1_13568392_13568617_-,NA|55aa|down_9|CP012672.1_13569835_13570000_-	NA|275aa|up_9|CP012672.1_13532883_13533708_+	cd01651, RT_G2_intron, RT_G2_intron: Reverse transcriptases (RTs) with group II intron origin	NA|470aa|up_8|CP012672.1_13533559_13534969_-	PTZ00048, PTZ00048, cytochrome c; Provisional	NA|345aa|up_7|CP012672.1_13535466_13536501_+	NA	NA|418aa|up_6|CP012672.1_13536549_13537803_+	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|1647aa|up_5|CP012672.1_13537822_13542763_+	pfam10442, FIST_C, FIST C domain	NA|1156aa|up_4|CP012672.1_13542830_13546298_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|593aa|up_3|CP012672.1_13546562_13548341_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|337aa|up_2|CP012672.1_13549756_13550767_-	PHA00370, III, attachment protein	NA|396aa|up_1|CP012672.1_13550930_13552118_-	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|555aa|up_0|CP012672.1_13552114_13553779_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|227aa|down_0|CP012672.1_13555283_13555964_-	pfam04116, FA_hydroxylase, Fatty acid hydroxylase superfamily	NA|72aa|down_1|CP012672.1_13556030_13556246_-	NA	NA|939aa|down_2|CP012672.1_13556511_13559328_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|128aa|down_3|CP012672.1_13559435_13559819_+	pfam12680, SnoaL_2, SnoaL-like domain	NA|558aa|down_4|CP012672.1_13559961_13561635_-	cd13563, PBP2_SsuA_like_6, Putative substrate binding domain of sulfonate binding protein-like, a member of the type 2 periplasmic binding protein fold	NA|773aa|down_5|CP012672.1_13561715_13564034_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|78aa|down_6|CP012672.1_13568150_13568384_+	NA	NA|75aa|down_7|CP012672.1_13568392_13568617_-	NA	NA|100aa|down_8|CP012672.1_13568515_13568815_+	pfam13817, DDE_Tnp_IS66_C, IS66 C-terminal element	NA|55aa|down_9|CP012672.1_13569835_13570000_-	NA
GCA_004135755.1_ASM413575v1	CP012672	Sorangium cellulosum strain So ce836 chromosome, complete genome	52	14320980-14324885	11,7,41,8,9	CRT,PILER-CR,CRISPRCasFinder,PILER-CR,PILER-CR	no	cas6,cas3,cas8b3,cas7,cas5,cas1,cas2	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	Unclear	CCGNTCCCCGCCGTGATGCCGGAAGGCGTTGAGCAC,CCCCGCCGTGATGCCGGAAGGCGTTGAGCAC,TGATGCCGGAAGGCGTTGAGCAC,CCGATCCCCGCCGTGATGCCGGAAGGCGTTGAGCAC,CCCCGCCGTGATGCCGGAAGGCGTTGAGCAC	36,31,23,36,31	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	54,50,54,50,50	54	Unclear	cas8u1,cas3,csb2gr5,csb1gr7,csa3,RT,DEDDh,WYL,PD-DExK,DinG,cas6,cas8b3,cas7,cas5,cas1,cas2	NA|291aa|up_8|CP012672.1_14310561_14311434_-,NA|252aa|up_7|CP012672.1_14311624_14312380_+,NA|88aa|down_0|CP012672.1_14324921_14325185_-,NA|328aa|down_8|CP012672.1_14340532_14341516_+,NA|148aa|down_9|CP012672.1_14341625_14342069_+	NA|217aa|up_9|CP012672.1_14309837_14310488_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|291aa|up_8|CP012672.1_14310561_14311434_-	NA	NA|252aa|up_7|CP012672.1_14311624_14312380_+	NA	cas6|210aa|up_6|CP012672.1_14313014_14313644_+	pfam09559, Cas6, Cas6 Crispr	cas3|821aa|up_5|CP012672.1_14313643_14316106_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas8b3|518aa|up_4|CP012672.1_14316090_14317644_+	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas7|338aa|up_3|CP012672.1_14317647_14318661_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas5|225aa|up_2|CP012672.1_14318657_14319332_+	cd09688, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas1|514aa|up_1|CP012672.1_14319165_14320707_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|99aa|up_0|CP012672.1_14320435_14320732_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|88aa|down_0|CP012672.1_14324921_14325185_-	NA	NA|129aa|down_1|CP012672.1_14325578_14325965_-	pfam14430, Imm1, Immunity protein Imm1	NA|2170aa|down_2|CP012672.1_14325980_14332490_-	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|500aa|down_3|CP012672.1_14332970_14334470_+	cd00657, Ferritin_like, Ferritin-like superfamily of diiron-containing four-helix-bundle proteins	NA|398aa|down_4|CP012672.1_14334711_14335905_+	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|310aa|down_5|CP012672.1_14336134_14337064_+	pfam13628, DUF4142, Domain of unknown function (DUF4142)	NA|481aa|down_6|CP012672.1_14337590_14339033_-	cd07115, ALDH_HMSADH_HapE, Pseudomonas fluorescens 4-hydroxymuconic semialdehyde dehydrogenase-like	NA|487aa|down_7|CP012672.1_14339075_14340536_+	TIGR00699, 4-aminobutyrate_aminotransferase, 4-aminobutyrate aminotransferase, eukaryotic type	NA|328aa|down_8|CP012672.1_14340532_14341516_+	NA	NA|148aa|down_9|CP012672.1_14341625_14342069_+	NA
