assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000006965.1_ASM696v1	NC_003047	Sinorhizobium meliloti 1021, complete genome	1	174959-175105	1	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	TCTCCCCGCATGCGGGGAGAAGGG	24	0	0	NA	NA	NA	2	2	Orphan	DEDDh,WYL,cas3,csa3,RT	NA,NA	NA|227aa|up_9|NC_003047.1_164042_164723_+	COG5461, COG5461, Type IV pili component [Cell motility and secretion]	NA|429aa|up_8|NC_003047.1_164745_166032_+	COG4963, CpaE, Flp pilus assembly protein, ATPase CpaE [Intracellular trafficking and secretion]	NA|490aa|up_7|NC_003047.1_166093_167563_+	COG4962, CpaF, Flp pilus assembly protein, ATPase CpaF [Intracellular trafficking and secretion]	NA|337aa|up_6|NC_003047.1_167621_168632_+	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|328aa|up_5|NC_003047.1_168639_169623_+	COG2064, TadC, Flp pilus assembly protein TadC [Cell motility and secretion / Intracellular trafficking and secretion]	NA|388aa|up_4|NC_003047.1_169851_171015_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|270aa|up_3|NC_003047.1_171272_172082_-	COG5010, TadD, Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]	NA|464aa|up_2|NC_003047.1_172212_173604_+	COG0260, PepB, Leucyl aminopeptidase [Amino acid transport and metabolism]	NA|115aa|up_1|NC_003047.1_173691_174036_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|285aa|up_0|NC_003047.1_174044_174899_+	COG0791, Spr, Cell wall-associated hydrolases (invasion-associated proteins) [Cell envelope biogenesis, outer membrane]	NA|320aa|down_0|NC_003047.1_175135_176095_-	cd12164, GDH_like_2, Putative glycerate dehydrogenase and related proteins of the D-specific 2-hydroxy dehydrogenase family	NA|543aa|down_1|NC_003047.1_176105_177734_-	COG4172, COG4172, ABC-type uncharacterized transport system, duplicated ATPase component [General function prediction only]	NA|378aa|down_2|NC_003047.1_177742_178876_-	COG4239, COG4239, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|361aa|down_3|NC_003047.1_178875_179958_-	COG4174, COG4174, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|620aa|down_4|NC_003047.1_180138_181998_-	cd08497, PBP2_NikA_DppA_OppA_like_14, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|356aa|down_5|NC_003047.1_182289_183357_+	COG3770, MepA, Murein endopeptidase [Cell envelope biogenesis, outer membrane]	NA|127aa|down_6|NC_003047.1_183477_183858_+	COG1803, MgsA, Methylglyoxal synthase [Carbohydrate transport and metabolism]	NA|340aa|down_7|NC_003047.1_183919_184939_+	PRK00292, glk, glucokinase; Provisional	NA|602aa|down_8|NC_003047.1_185047_186853_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|273aa|down_9|NC_003047.1_186870_187689_+	COG0289, DapB, Dihydrodipicolinate reductase [Amino acid transport and metabolism]
GCF_000006965.1_ASM696v1	NC_003047	Sinorhizobium meliloti 1021, complete genome	2	1131916-1132026	2	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	GGGCGAAGGGCCGGTCTATCCGAACATGTGAATGT	35	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|52aa|up_3|NC_003047.1_1127281_1127437_-,NA|59aa|up_2|NC_003047.1_1127474_1127651_-,NA	NA|466aa|up_9|NC_003047.1_1114615_1116013_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|987aa|up_8|NC_003047.1_1116047_1119008_+	PRK14108, PRK14108, bifunctional [glutamine synthetase] adenylyltransferase/[glutamine synthetase]-adenylyl-L-tyrosine phosphorylase	NA|776aa|up_7|NC_003047.1_1119018_1121346_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|885aa|up_6|NC_003047.1_1121699_1124354_-	PRK14015, pepN, aminopeptidase N; Provisional	NA|313aa|up_5|NC_003047.1_1124779_1125718_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|430aa|up_4|NC_003047.1_1125727_1127017_-	cd17477, MFS_YcaD_like, YcaD and similar transporters of the Major Facilitator Superfamily	NA|52aa|up_3|NC_003047.1_1127281_1127437_-	NA	NA|59aa|up_2|NC_003047.1_1127474_1127651_-	NA	NA|287aa|up_1|NC_003047.1_1127789_1128650_-	cd10030, UDG-F4_TTUDGA_SPO1dp_like, Uracil DNA glycosylase family 4, includes Thermotoga maritima TTUDGA, Bacillus phage SPO1 DNA polymerase, and similar proteins	NA|430aa|up_0|NC_003047.1_1128722_1130012_-	cd10231, YegD_like, Escherichia coli YegD, a putative chaperone protein, and related proteins	NA|333aa|down_0|NC_003047.1_1132434_1133433_+	cd13641, PBP2_HisX_like, Substrate-binding domain of ABC-type histidine transporter involves in betaine and proline uptake; the type 2 periplasmic-binding protein fold	NA|296aa|down_1|NC_003047.1_1133653_1134541_+	COG4176, ProW, ABC-type proline/glycine betaine transport system, permease component [Amino acid transport and metabolism]	NA|338aa|down_2|NC_003047.1_1134630_1135644_-	TIGR02817, Putative_oxidoreductase_transmembrane_protein, zinc-binding alcohol dehydrogenase family protein	NA|134aa|down_3|NC_003047.1_1135744_1136146_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|309aa|down_4|NC_003047.1_1136284_1137211_+	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|496aa|down_5|NC_003047.1_1137226_1138714_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|340aa|down_6|NC_003047.1_1138706_1139726_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|319aa|down_7|NC_003047.1_1139821_1140778_+	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|501aa|down_8|NC_003047.1_1140806_1142309_-	PRK08292, PRK08292, AMP nucleosidase; Provisional	NA|107aa|down_9|NC_003047.1_1142388_1142709_-	COG1742, COG1742, Uncharacterized conserved protein [Function unknown]
GCF_000006965.1_ASM696v1	NC_003047	Sinorhizobium meliloti 1021, complete genome	3	2008150-2008262	3	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	CCCTCATTCCTGTGCTTGTCACAGGAAT	28	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA,NA|313aa|down_2|NC_003047.1_2010913_2011852_+	NA|246aa|up_9|NC_003047.1_1995968_1996706_+	cd06170, LuxR_C_like, C-terminal DNA-binding domain of LuxR-like proteins	NA|215aa|up_8|NC_003047.1_1996862_1997507_+	COG3916, LasI, N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|376aa|up_7|NC_003047.1_1997674_1998802_+	pfam11064, DUF2865, Protein of unknown function (DUF2865)	NA|228aa|up_6|NC_003047.1_1998825_1999509_-	cd02145, BluB, 5,6-dimethylbenzimidazole synthase	NA|274aa|up_5|NC_003047.1_1999806_2000628_-	PRK06198, PRK06198, short chain dehydrogenase; Provisional	NA|398aa|up_4|NC_003047.1_2000999_2002193_-	COG5285, COG5285, Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism]	NA|341aa|up_3|NC_003047.1_2002290_2003313_+	cd06307, PBP1_sugar_binding, periplasmic sugar-binding domain of uncharacterized transport systems	NA|276aa|up_2|NC_003047.1_2003323_2004151_-	COG0823, TolB, Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]	NA|561aa|up_1|NC_003047.1_2004153_2005836_-	PRK13981, PRK13981, NAD synthetase; Provisional	NA|640aa|up_0|NC_003047.1_2006104_2008024_+	COG1368, MdoB, Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily [Cell envelope biogenesis, outer membrane]	NA|141aa|down_0|NC_003047.1_2008650_2009073_+	COG4961, TadG, Flp pilus assembly protein TadG [Intracellular trafficking and secretion]	NA|578aa|down_1|NC_003047.1_2009074_2010808_+	COG4655, COG4655, Predicted membrane protein [Function unknown]	NA|313aa|down_2|NC_003047.1_2010913_2011852_+	NA	NA|132aa|down_3|NC_003047.1_2011873_2012269_-	COG3791, COG3791, Uncharacterized conserved protein [Function unknown]	NA|458aa|down_4|NC_003047.1_2012404_2013778_-	pfam01474, DAHP_synth_2, Class-II DAHP synthetase family	NA|464aa|down_5|NC_003047.1_2014010_2015402_-	TIGR01424, Glutathione_reductase_cytosolic, glutathione-disulfide reductase, plant	NA|181aa|down_6|NC_003047.1_2015606_2016149_-	COG3184, COG3184, Uncharacterized protein conserved in bacteria [Function unknown]	NA|232aa|down_7|NC_003047.1_2016260_2016956_-	PRK00702, PRK00702, ribose-5-phosphate isomerase RpiA	NA|228aa|down_8|NC_003047.1_2017175_2017859_+	PRK13222, PRK13222, N-acetylmuramic acid 6-phosphate phosphatase MupP	NA|632aa|down_9|NC_003047.1_2017891_2019787_-	COG2989, COG2989, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000006965.1_ASM696v1	NC_003078	Sinorhizobium meliloti 1021 plasmid pSymB, complete sequence	1	1085052-1085154	1	CRISPRCasFinder	no		RT,csa3	Orphan	TCGGCGGCGGCGGCGGTGCGGGCGGCA	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|133aa|up_2|NC_003078.1_1078419_1078818_+,NA|196aa|down_2|NC_003078.1_1090736_1091324_+,NA|198aa|down_4|NC_003078.1_1092150_1092744_-	NA|399aa|up_9|NC_003078.1_1065779_1066976_+	PRK13479, PRK13479, 2-aminoethylphosphonate--pyruvate transaminase; Provisional	NA|425aa|up_8|NC_003078.1_1067003_1068278_+	TIGR02335, phosphonoacetate_hydrolase, phosphonoacetate hydrolase	NA|486aa|up_7|NC_003078.1_1068274_1069732_+	TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde dehydrogenase	NA|341aa|up_6|NC_003078.1_1070449_1071472_+	TIGR03261, phnS2, putative 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein	NA|396aa|up_5|NC_003078.1_1071536_1072724_+	TIGR03265, PhnT2, putative 2-aminoethylphosphonate ABC transporter, ATP-binding protein	NA|681aa|up_4|NC_003078.1_1072729_1074772_+	TIGR03262, PhnU2, putative 2-aminoethylphosphonate ABC transporter, permease protein	NA|1113aa|up_3|NC_003078.1_1074956_1078295_+	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|133aa|up_2|NC_003078.1_1078419_1078818_+	NA	NA|261aa|up_1|NC_003078.1_1078906_1079689_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|384aa|up_0|NC_003078.1_1079676_1080828_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|304aa|down_0|NC_003078.1_1088139_1089051_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|405aa|down_1|NC_003078.1_1089158_1090373_+	pfam04143, Sulf_transp, Sulphur transport	NA|196aa|down_2|NC_003078.1_1090736_1091324_+	NA	NA|168aa|down_3|NC_003078.1_1091492_1091996_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|198aa|down_4|NC_003078.1_1092150_1092744_-	NA	NA|594aa|down_5|NC_003078.1_1093240_1095022_+	PRK03659, PRK03659, glutathione-regulated potassium-efflux system protein KefB; Provisional	NA|742aa|down_6|NC_003078.1_1095102_1097328_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|326aa|down_7|NC_003078.1_1097324_1098302_-	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|208aa|down_8|NC_003078.1_1098298_1098922_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|297aa|down_9|NC_003078.1_1099120_1100011_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
GCF_000006965.1_ASM696v1	NC_003078	Sinorhizobium meliloti 1021 plasmid pSymB, complete sequence	2	1085247-1085328	2	CRISPRCasFinder	no		RT,csa3	Orphan	TCGGCGGCGGCGGCGGTGCGGGCGGCA	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|133aa|up_2|NC_003078.1_1078419_1078818_+,NA|196aa|down_2|NC_003078.1_1090736_1091324_+,NA|198aa|down_4|NC_003078.1_1092150_1092744_-	NA|399aa|up_9|NC_003078.1_1065779_1066976_+	PRK13479, PRK13479, 2-aminoethylphosphonate--pyruvate transaminase; Provisional	NA|425aa|up_8|NC_003078.1_1067003_1068278_+	TIGR02335, phosphonoacetate_hydrolase, phosphonoacetate hydrolase	NA|486aa|up_7|NC_003078.1_1068274_1069732_+	TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde dehydrogenase	NA|341aa|up_6|NC_003078.1_1070449_1071472_+	TIGR03261, phnS2, putative 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein	NA|396aa|up_5|NC_003078.1_1071536_1072724_+	TIGR03265, PhnT2, putative 2-aminoethylphosphonate ABC transporter, ATP-binding protein	NA|681aa|up_4|NC_003078.1_1072729_1074772_+	TIGR03262, PhnU2, putative 2-aminoethylphosphonate ABC transporter, permease protein	NA|1113aa|up_3|NC_003078.1_1074956_1078295_+	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|133aa|up_2|NC_003078.1_1078419_1078818_+	NA	NA|261aa|up_1|NC_003078.1_1078906_1079689_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|384aa|up_0|NC_003078.1_1079676_1080828_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|304aa|down_0|NC_003078.1_1088139_1089051_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|405aa|down_1|NC_003078.1_1089158_1090373_+	pfam04143, Sulf_transp, Sulphur transport	NA|196aa|down_2|NC_003078.1_1090736_1091324_+	NA	NA|168aa|down_3|NC_003078.1_1091492_1091996_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|198aa|down_4|NC_003078.1_1092150_1092744_-	NA	NA|594aa|down_5|NC_003078.1_1093240_1095022_+	PRK03659, PRK03659, glutathione-regulated potassium-efflux system protein KefB; Provisional	NA|742aa|down_6|NC_003078.1_1095102_1097328_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|326aa|down_7|NC_003078.1_1097324_1098302_-	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|208aa|down_8|NC_003078.1_1098298_1098922_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|297aa|down_9|NC_003078.1_1099120_1100011_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
GCF_000006965.1_ASM696v1	NC_003078	Sinorhizobium meliloti 1021 plasmid pSymB, complete sequence	3	1085427-1085531	3	CRISPRCasFinder	no		RT,csa3	Orphan	TCGGCGGCGGCGGCGGTGCGGGCGGCA	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,RT	NA|133aa|up_2|NC_003078.1_1078419_1078818_+,NA|196aa|down_2|NC_003078.1_1090736_1091324_+,NA|198aa|down_4|NC_003078.1_1092150_1092744_-	NA|399aa|up_9|NC_003078.1_1065779_1066976_+	PRK13479, PRK13479, 2-aminoethylphosphonate--pyruvate transaminase; Provisional	NA|425aa|up_8|NC_003078.1_1067003_1068278_+	TIGR02335, phosphonoacetate_hydrolase, phosphonoacetate hydrolase	NA|486aa|up_7|NC_003078.1_1068274_1069732_+	TIGR03250, PhnAcAld_DH, putative phosphonoacetaldehyde dehydrogenase	NA|341aa|up_6|NC_003078.1_1070449_1071472_+	TIGR03261, phnS2, putative 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein	NA|396aa|up_5|NC_003078.1_1071536_1072724_+	TIGR03265, PhnT2, putative 2-aminoethylphosphonate ABC transporter, ATP-binding protein	NA|681aa|up_4|NC_003078.1_1072729_1074772_+	TIGR03262, PhnU2, putative 2-aminoethylphosphonate ABC transporter, permease protein	NA|1113aa|up_3|NC_003078.1_1074956_1078295_+	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|133aa|up_2|NC_003078.1_1078419_1078818_+	NA	NA|261aa|up_1|NC_003078.1_1078906_1079689_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|384aa|up_0|NC_003078.1_1079676_1080828_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|304aa|down_0|NC_003078.1_1088139_1089051_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|405aa|down_1|NC_003078.1_1089158_1090373_+	pfam04143, Sulf_transp, Sulphur transport	NA|196aa|down_2|NC_003078.1_1090736_1091324_+	NA	NA|168aa|down_3|NC_003078.1_1091492_1091996_+	pfam13523, Acetyltransf_8, Acetyltransferase (GNAT) domain	NA|198aa|down_4|NC_003078.1_1092150_1092744_-	NA	NA|594aa|down_5|NC_003078.1_1093240_1095022_+	PRK03659, PRK03659, glutathione-regulated potassium-efflux system protein KefB; Provisional	NA|742aa|down_6|NC_003078.1_1095102_1097328_-	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|326aa|down_7|NC_003078.1_1097324_1098302_-	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|208aa|down_8|NC_003078.1_1098298_1098922_-	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|297aa|down_9|NC_003078.1_1099120_1100011_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]
