assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000346065.1_ASM34606v1	NC_020528	Sinorhizobium meliloti 2011, complete sequence	1	174960-175106	1	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	TCTCCCCGCATGCGGGGAGAAGGG	24	0	0	NA	NA	NA	2	2	Orphan	DEDDh,WYL,cas3,csa3,RT	NA,NA	NA|512aa|up_9|NC_020528.1_162450_163986_+	COG4964, CpaC, Flp pilus assembly protein, secretin CpaC [Intracellular trafficking and secretion]	NA|227aa|up_8|NC_020528.1_164042_164723_+	COG5461, COG5461, Type IV pili component [Cell motility and secretion]	NA|490aa|up_7|NC_020528.1_166094_167564_+	COG4962, CpaF, Flp pilus assembly protein, ATPase CpaF [Intracellular trafficking and secretion]	NA|337aa|up_6|NC_020528.1_167622_168633_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|328aa|up_5|NC_020528.1_168640_169624_+	COG2064, TadC, Flp pilus assembly protein TadC [Cell motility and secretion / Intracellular trafficking and secretion]	NA|388aa|up_4|NC_020528.1_169852_171016_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|270aa|up_3|NC_020528.1_171273_172083_-	COG5010, TadD, Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]	NA|464aa|up_2|NC_020528.1_172213_173605_+	COG0260, PepB, Leucyl aminopeptidase [Amino acid transport and metabolism]	NA|115aa|up_1|NC_020528.1_173692_174037_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|285aa|up_0|NC_020528.1_174045_174900_+	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|320aa|down_0|NC_020528.1_175136_176096_-	cd12164, GDH_like_2, Putative glycerate dehydrogenase and related proteins of the D-specific 2-hydroxy dehydrogenase family	NA|543aa|down_1|NC_020528.1_176106_177735_-	COG4172, COG4172, ABC-type uncharacterized transport system, duplicated ATPase component [General function prediction only]	NA|378aa|down_2|NC_020528.1_177743_178877_-	COG4239, COG4239, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|361aa|down_3|NC_020528.1_178876_179959_-	COG4174, COG4174, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|620aa|down_4|NC_020528.1_180139_181999_-	cd08497, PBP2_NikA_DppA_OppA_like_14, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|356aa|down_5|NC_020528.1_182290_183358_+	COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis, outer membrane]	NA|127aa|down_6|NC_020528.1_183478_183859_+	COG1803, MgsA, Methylglyoxal synthase [Carbohydrate transport and metabolism]	NA|340aa|down_7|NC_020528.1_183920_184940_+	PRK00292, glk, glucokinase; Provisional	NA|602aa|down_8|NC_020528.1_185048_186854_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|273aa|down_9|NC_020528.1_186871_187690_+	COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport and metabolism]
GCF_000346065.1_ASM34606v1	NC_020528	Sinorhizobium meliloti 2011, complete sequence	2	1131918-1132028	2	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	GGGCGAAGGGCCGGTCTATCCGAACATGTGAATGT	35	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|52aa|up_3|NC_020528.1_1127283_1127439_-,NA|59aa|up_2|NC_020528.1_1127476_1127653_-,NA	NA|466aa|up_9|NC_020528.1_1114617_1116015_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|987aa|up_8|NC_020528.1_1116049_1119010_+	PRK14108, PRK14108, bifunctional [glutamine synthetase] adenylyltransferase/[glutamine synthetase]-adenylyl-L-tyrosine phosphorylase	NA|776aa|up_7|NC_020528.1_1119020_1121348_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|885aa|up_6|NC_020528.1_1121701_1124356_-	PRK14015, pepN, aminopeptidase N; Provisional	NA|313aa|up_5|NC_020528.1_1124781_1125720_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|430aa|up_4|NC_020528.1_1125729_1127019_-	cd17477, MFS_YcaD_like, YcaD and similar transporters of the Major Facilitator Superfamily	NA|52aa|up_3|NC_020528.1_1127283_1127439_-	NA	NA|59aa|up_2|NC_020528.1_1127476_1127653_-	NA	NA|287aa|up_1|NC_020528.1_1127791_1128652_-	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|430aa|up_0|NC_020528.1_1128724_1130014_-	cd10231, YegD_like, Escherichia coli YegD, a putative chaperone protein, and related proteins	NA|333aa|down_0|NC_020528.1_1132436_1133435_+	cd13641, PBP2_HisX_like, Substrate-binding domain of ABC-type histidine transporter involves in betaine and proline uptake; the type 2 periplasmic-binding protein fold	NA|296aa|down_1|NC_020528.1_1133655_1134543_+	COG4176, ProW, ABC-type proline/glycine betaine transport system, permease component [Amino acid transport and metabolism]	NA|338aa|down_2|NC_020528.1_1134632_1135646_-	TIGR02817, Putative_oxidoreductase_transmembrane_protein, zinc-binding alcohol dehydrogenase family protein	NA|134aa|down_3|NC_020528.1_1135746_1136148_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|309aa|down_4|NC_020528.1_1136286_1137213_+	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|496aa|down_5|NC_020528.1_1137228_1138716_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|340aa|down_6|NC_020528.1_1138708_1139728_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|319aa|down_7|NC_020528.1_1139823_1140780_+	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|501aa|down_8|NC_020528.1_1140808_1142311_-	PRK08292, PRK08292, AMP nucleosidase; Provisional	NA|107aa|down_9|NC_020528.1_1142390_1142711_-	COG1742, COG1742, Uncharacterized conserved protein [Function unknown]
GCF_000346065.1_ASM34606v1	NC_020528	Sinorhizobium meliloti 2011, complete sequence	3	2007852-2007964	3	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	CCCTCATTCCTGTGCTTGTCACAGGAAT	28	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA,NA|313aa|down_2|NC_020528.1_2010615_2011554_+	NA|246aa|up_9|NC_020528.1_1995670_1996408_+	cd06170, LuxR_C_like, C-terminal DNA-binding domain of LuxR-like proteins	NA|215aa|up_8|NC_020528.1_1996564_1997209_+	COG3916, LasI, N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|376aa|up_7|NC_020528.1_1997376_1998504_+	pfam11064, DUF2865, Protein of unknown function (DUF2865)	NA|228aa|up_6|NC_020528.1_1998527_1999211_-	cd02145, BluB, 5,6-dimethylbenzimidazole synthase	NA|274aa|up_5|NC_020528.1_1999508_2000330_-	PRK06198, PRK06198, short chain dehydrogenase; Provisional	NA|398aa|up_4|NC_020528.1_2000701_2001895_-	COG5285, COG5285, Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism]	NA|341aa|up_3|NC_020528.1_2001992_2003015_+	cd06307, PBP1_sugar_binding, periplasmic sugar-binding domain of uncharacterized transport systems	NA|276aa|up_2|NC_020528.1_2003025_2003853_-	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|561aa|up_1|NC_020528.1_2003855_2005538_-	PRK13981, PRK13981, NAD synthetase; Provisional	NA|640aa|up_0|NC_020528.1_2005806_2007726_+	COG1368, MdoB, Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily [Cell envelope biogenesis, outer membrane]	NA|141aa|down_0|NC_020528.1_2008352_2008775_+	COG4961, TadG, Flp pilus assembly protein TadG [Intracellular trafficking and secretion]	NA|578aa|down_1|NC_020528.1_2008776_2010510_+	COG4655, COG4655, Predicted membrane protein [Function unknown]	NA|313aa|down_2|NC_020528.1_2010615_2011554_+	NA	NA|132aa|down_3|NC_020528.1_2011575_2011971_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|458aa|down_4|NC_020528.1_2012106_2013480_-	pfam01474, DAHP_synth_2, Class-II DAHP synthetase family	NA|464aa|down_5|NC_020528.1_2013712_2015104_-	TIGR01424, Glutathione_reductase_cytosolic, glutathione-disulfide reductase, plant	NA|181aa|down_6|NC_020528.1_2015308_2015851_-	COG3184, COG3184, Uncharacterized protein conserved in bacteria [Function unknown]	NA|232aa|down_7|NC_020528.1_2015962_2016658_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|228aa|down_8|NC_020528.1_2016877_2017561_+	PRK13222, PRK13222, N-acetylmuramic acid 6-phosphate phosphatase MupP	NA|632aa|down_9|NC_020528.1_2017593_2019489_-	COG2989, COG2989, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000346065.1_ASM34606v1	NC_020560	Sinorhizobium meliloti 2011 plasmid pSymB, complete sequence	1	1085057-1085159	1	CRISPRCasFinder	no		RT,csa3	Orphan	TCGGCGGCGGCGGCGGTGCGGGCGGCA	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|133aa|up_2|NC_020560.1_1078424_1078823_+,NA|196aa|down_2|NC_020560.1_1090741_1091329_+,NA|198aa|down_4|NC_020560.1_1092155_1092749_-	NA|399aa|up_9|NC_020560.1_1065782_1066979_+	PRK13479, PRK13479, 2-aminoethylphosphonate--pyruvate transaminase; Provisional	NA|425aa|up_8|NC_020560.1_1067006_1068281_+	TIGR02335, phosphonoacetate_hydrolase, phosphonoacetate hydrolase	NA|486aa|up_7|NC_020560.1_1068277_1069735_+	TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde dehydrogenase	NA|341aa|up_6|NC_020560.1_1070452_1071475_+	TIGR03261, phnS2, putative 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein	NA|388aa|up_5|NC_020560.1_1071564_1072728_+	TIGR03265, PhnT2, putative 2-aminoethylphosphonate ABC transporter, ATP-binding protein	NA|681aa|up_4|NC_020560.1_1072733_1074776_+	TIGR03262, PhnU2, putative 2-aminoethylphosphonate ABC transporter, permease protein	NA|1113aa|up_3|NC_020560.1_1074960_1078299_+	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|133aa|up_2|NC_020560.1_1078424_1078823_+	NA	NA|261aa|up_1|NC_020560.1_1078911_1079694_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|384aa|up_0|NC_020560.1_1079681_1080833_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|304aa|down_0|NC_020560.1_1088144_1089056_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|405aa|down_1|NC_020560.1_1089163_1090378_+	pfam04143, Sulf_transp, Sulphur transport	NA|196aa|down_2|NC_020560.1_1090741_1091329_+	NA	NA|168aa|down_3|NC_020560.1_1091497_1092001_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|198aa|down_4|NC_020560.1_1092155_1092749_-	NA	NA|594aa|down_5|NC_020560.1_1093245_1095027_+	PRK03659, PRK03659, glutathione-regulated potassium-efflux system protein KefB; Provisional	NA|742aa|down_6|NC_020560.1_1095107_1097333_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|326aa|down_7|NC_020560.1_1097329_1098307_-	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|208aa|down_8|NC_020560.1_1098303_1098927_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|297aa|down_9|NC_020560.1_1099126_1100017_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
GCF_000346065.1_ASM34606v1	NC_020560	Sinorhizobium meliloti 2011 plasmid pSymB, complete sequence	2	1085252-1085333	2	CRISPRCasFinder	no		RT,csa3	Orphan	TCGGCGGCGGCGGCGGTGCGGGCGGCA	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|133aa|up_2|NC_020560.1_1078424_1078823_+,NA|196aa|down_2|NC_020560.1_1090741_1091329_+,NA|198aa|down_4|NC_020560.1_1092155_1092749_-	NA|399aa|up_9|NC_020560.1_1065782_1066979_+	PRK13479, PRK13479, 2-aminoethylphosphonate--pyruvate transaminase; Provisional	NA|425aa|up_8|NC_020560.1_1067006_1068281_+	TIGR02335, phosphonoacetate_hydrolase, phosphonoacetate hydrolase	NA|486aa|up_7|NC_020560.1_1068277_1069735_+	TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde dehydrogenase	NA|341aa|up_6|NC_020560.1_1070452_1071475_+	TIGR03261, phnS2, putative 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein	NA|388aa|up_5|NC_020560.1_1071564_1072728_+	TIGR03265, PhnT2, putative 2-aminoethylphosphonate ABC transporter, ATP-binding protein	NA|681aa|up_4|NC_020560.1_1072733_1074776_+	TIGR03262, PhnU2, putative 2-aminoethylphosphonate ABC transporter, permease protein	NA|1113aa|up_3|NC_020560.1_1074960_1078299_+	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|133aa|up_2|NC_020560.1_1078424_1078823_+	NA	NA|261aa|up_1|NC_020560.1_1078911_1079694_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|384aa|up_0|NC_020560.1_1079681_1080833_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|304aa|down_0|NC_020560.1_1088144_1089056_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|405aa|down_1|NC_020560.1_1089163_1090378_+	pfam04143, Sulf_transp, Sulphur transport	NA|196aa|down_2|NC_020560.1_1090741_1091329_+	NA	NA|168aa|down_3|NC_020560.1_1091497_1092001_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|198aa|down_4|NC_020560.1_1092155_1092749_-	NA	NA|594aa|down_5|NC_020560.1_1093245_1095027_+	PRK03659, PRK03659, glutathione-regulated potassium-efflux system protein KefB; Provisional	NA|742aa|down_6|NC_020560.1_1095107_1097333_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|326aa|down_7|NC_020560.1_1097329_1098307_-	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|208aa|down_8|NC_020560.1_1098303_1098927_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|297aa|down_9|NC_020560.1_1099126_1100017_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
GCF_000346065.1_ASM34606v1	NC_020560	Sinorhizobium meliloti 2011 plasmid pSymB, complete sequence	3	1085432-1085536	3	CRISPRCasFinder	no		RT,csa3	Orphan	TCGGCGGCGGCGGCGGTGCGGGCGGCA	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|133aa|up_2|NC_020560.1_1078424_1078823_+,NA|196aa|down_2|NC_020560.1_1090741_1091329_+,NA|198aa|down_4|NC_020560.1_1092155_1092749_-	NA|399aa|up_9|NC_020560.1_1065782_1066979_+	PRK13479, PRK13479, 2-aminoethylphosphonate--pyruvate transaminase; Provisional	NA|425aa|up_8|NC_020560.1_1067006_1068281_+	TIGR02335, phosphonoacetate_hydrolase, phosphonoacetate hydrolase	NA|486aa|up_7|NC_020560.1_1068277_1069735_+	TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde dehydrogenase	NA|341aa|up_6|NC_020560.1_1070452_1071475_+	TIGR03261, phnS2, putative 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein	NA|388aa|up_5|NC_020560.1_1071564_1072728_+	TIGR03265, PhnT2, putative 2-aminoethylphosphonate ABC transporter, ATP-binding protein	NA|681aa|up_4|NC_020560.1_1072733_1074776_+	TIGR03262, PhnU2, putative 2-aminoethylphosphonate ABC transporter, permease protein	NA|1113aa|up_3|NC_020560.1_1074960_1078299_+	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|133aa|up_2|NC_020560.1_1078424_1078823_+	NA	NA|261aa|up_1|NC_020560.1_1078911_1079694_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|384aa|up_0|NC_020560.1_1079681_1080833_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|304aa|down_0|NC_020560.1_1088144_1089056_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|405aa|down_1|NC_020560.1_1089163_1090378_+	pfam04143, Sulf_transp, Sulphur transport	NA|196aa|down_2|NC_020560.1_1090741_1091329_+	NA	NA|168aa|down_3|NC_020560.1_1091497_1092001_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|198aa|down_4|NC_020560.1_1092155_1092749_-	NA	NA|594aa|down_5|NC_020560.1_1093245_1095027_+	PRK03659, PRK03659, glutathione-regulated potassium-efflux system protein KefB; Provisional	NA|742aa|down_6|NC_020560.1_1095107_1097333_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|326aa|down_7|NC_020560.1_1097329_1098307_-	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|208aa|down_8|NC_020560.1_1098303_1098927_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|297aa|down_9|NC_020560.1_1099126_1100017_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
