assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000176035.2_ASM17603v2	NC_015675	Mesorhizobium opportunistum WSM2075, complete sequence	1	3129517-3129966	1	CRT	no		WYL,csa3,DEDDh,cas3	Orphan	TTGCCCTGGGCCTGCTNN	18	5	5	3129535-3129552|3129607-3129624|3129679-3129696|3129751-3129768|3129823-3129840	NC_015675.1_4798333-4798350|NC_015675.1_5745738-5745721|NC_015675.1_5745738-5745721|NC_015675.1_5745738-5745721|NC_015675.1_5745738-5745721	NA	12	12	Orphan	WYL,csa3,DEDDh,cas3	NA|201aa|up_4|NC_015675.1_3124942_3125545_-,NA|73aa|down_0|NC_015675.1_3130284_3130503_+,NA|79aa|down_2|NC_015675.1_3131519_3131756_+,NA|62aa|down_4|NC_015675.1_3132242_3132428_+	NA|335aa|up_9|NC_015675.1_3119595_3120600_+	COG1376, ErfK, Uncharacterized protein conserved in bacteria [Function unknown]	NA|356aa|up_8|NC_015675.1_3120814_3121882_+	cd00831, CHS_like, Chalcone and stilbene synthases; plant-specific polyketide synthases (PKS) and related enzymes, also called type III PKSs	NA|242aa|up_7|NC_015675.1_3121878_3122604_+	PRK06202, PRK06202, hypothetical protein; Provisional	NA|414aa|up_6|NC_015675.1_3122600_3123842_+	COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases [Coenzyme metabolism / Energy production and conversion]	NA|307aa|up_5|NC_015675.1_3123988_3124909_+	cd00377, ICL_PEPM, Members of the ICL/PEPM enzyme family catalyze either P-C or C-C bond formation/cleavage	NA|201aa|up_4|NC_015675.1_3124942_3125545_-	NA	NA|204aa|up_3|NC_015675.1_3125833_3126445_+	COG3145, AlkB, Alkylated DNA repair protein [DNA replication, recombination, and repair]	NA|147aa|up_2|NC_015675.1_3126582_3127023_+	COG4244, COG4244, Predicted membrane protein [Function unknown]	NA|450aa|up_1|NC_015675.1_3127019_3128369_+	COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]	NA|206aa|up_0|NC_015675.1_3128428_3129046_+	pfam03358, FMN_red, NADPH-dependent FMN reductase	NA|73aa|down_0|NC_015675.1_3130284_3130503_+	NA	NA|240aa|down_1|NC_015675.1_3130609_3131329_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|79aa|down_2|NC_015675.1_3131519_3131756_+	NA	NA|65aa|down_3|NC_015675.1_3131968_3132163_+	pfam11154, DUF2934, Protein of unknown function (DUF2934)	NA|62aa|down_4|NC_015675.1_3132242_3132428_+	NA	NA|299aa|down_5|NC_015675.1_3132424_3133321_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|143aa|down_6|NC_015675.1_3133462_3133891_+	COG2343, COG2343, Uncharacterized protein conserved in bacteria [Function unknown]	NA|140aa|down_7|NC_015675.1_3133938_3134358_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|203aa|down_8|NC_015675.1_3134667_3135276_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|148aa|down_9|NC_015675.1_3135291_3135735_-	COG1661, COG1661, Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only]
GCF_000176035.2_ASM17603v2	NC_015675	Mesorhizobium opportunistum WSM2075, complete sequence	2	3303732-3303820	1	CRISPRCasFinder	no		WYL,csa3,DEDDh,cas3	Orphan	CCCGCTCTTCAGGTATTCAGCCCCTTCAAA	30	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,DEDDh,cas3	NA|151aa|up_5|NC_015675.1_3298611_3299064_+,NA|96aa|up_4|NC_015675.1_3300107_3300395_-,NA|204aa|down_3|NC_015675.1_3306837_3307449_-,NA|104aa|down_7|NC_015675.1_3310867_3311179_-,NA|147aa|down_9|NC_015675.1_3313062_3313503_-	NA|228aa|up_9|NC_015675.1_3293348_3294032_+	COG3571, COG3571, Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]	NA|257aa|up_8|NC_015675.1_3294459_3295230_+	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	NA|398aa|up_7|NC_015675.1_3295370_3296564_+	pfam17210, SdrD_B, SdrD B-like domain	NA|518aa|up_6|NC_015675.1_3296976_3298530_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|151aa|up_5|NC_015675.1_3298611_3299064_+	NA	NA|96aa|up_4|NC_015675.1_3300107_3300395_-	NA	NA|247aa|up_3|NC_015675.1_3300504_3301245_-	pfam17784, Sulfotransfer_4, Sulfotransferase domain	NA|82aa|up_2|NC_015675.1_3301462_3301708_+	pfam11089, SyrA, Exopolysaccharide production repressor	NA|101aa|up_1|NC_015675.1_3302304_3302607_+	pfam06823, DUF1236, Protein of unknown function (DUF1236)	NA|289aa|up_0|NC_015675.1_3302723_3303590_-	PRK11752, PRK11752, putative S-transferase; Provisional	NA|302aa|down_0|NC_015675.1_3304000_3304906_+	COG0265, DegQ, Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones]	NA|320aa|down_1|NC_015675.1_3304928_3305888_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|243aa|down_2|NC_015675.1_3305900_3306629_-	PRK07041, PRK07041, SDR family oxidoreductase	NA|204aa|down_3|NC_015675.1_3306837_3307449_-	NA	NA|115aa|down_4|NC_015675.1_3308266_3308611_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|200aa|down_5|NC_015675.1_3308807_3309407_+	COG4430, COG4430, Uncharacterized protein conserved in bacteria [Function unknown]	NA|364aa|down_6|NC_015675.1_3309692_3310784_+	cd13717, PBP2_iGluR_putative, The ligand-binding domain of putative ionotropic glutamate receptors, a member of the type 2 periplasmic binding fold protein superfamily	NA|104aa|down_7|NC_015675.1_3310867_3311179_-	NA	NA|541aa|down_8|NC_015675.1_3311435_3313058_+	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|147aa|down_9|NC_015675.1_3313062_3313503_-	NA
GCF_000176035.2_ASM17603v2	NC_015675	Mesorhizobium opportunistum WSM2075, complete sequence	3	3620464-3620550	2	CRISPRCasFinder	no	csa3	WYL,csa3,DEDDh,cas3	Type I-A	TGCCGTTGAACCCGGTTCAACGGCAACTC	29	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,DEDDh,cas3	NA,NA	NA|281aa|up_9|NC_015675.1_3610714_3611557_-	PRK08263, PRK08263, short chain dehydrogenase; Provisional	NA|321aa|up_8|NC_015675.1_3611690_3612653_-	cd01558, D-AAT_like, D-Alanine aminotransferase (D-AAT_like): D-amino acid aminotransferase catalyzes transamination between D-amino acids and their respective alpha-keto acids	NA|170aa|up_7|NC_015675.1_3612802_3613312_+	pfam05110, AF-4, AF-4 proto-oncoprotein	NA|105aa|up_6|NC_015675.1_3613318_3613633_-	COG5586, COG5586, Uncharacterized conserved protein [Function unknown]	NA|174aa|up_5|NC_015675.1_3613629_3614151_-	COG1720, COG1720, Uncharacterized conserved protein [Function unknown]	csa3|341aa|up_4|NC_015675.1_3614359_3615382_+	pfam13847, Methyltransf_31, Methyltransferase domain	NA|306aa|up_3|NC_015675.1_3615381_3616299_+	TIGR00676, 510-methylenetetrahydrofolate_reductase	NA|310aa|up_2|NC_015675.1_3616668_3617598_-	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|407aa|up_1|NC_015675.1_3617821_3619042_+	cd00622, PLPDE_III_ODC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Ornithine Decarboxylase	NA|393aa|up_0|NC_015675.1_3619118_3620297_-	COG2951, MltB, Membrane-bound lytic murein transglycosylase B [Cell envelope biogenesis, outer membrane]	NA|267aa|down_0|NC_015675.1_3620596_3621397_-	cd06413, GH25_muramidase_1, Uncharacterized bacterial muramidase containing a glycosyl hydrolase family 25 (GH25) catalytic domain	NA|460aa|down_1|NC_015675.1_3621655_3623035_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|103aa|down_2|NC_015675.1_3623071_3623380_-	COG3093, VapI, Plasmid maintenance system antidote protein [General function prediction only]	NA|228aa|down_3|NC_015675.1_3623800_3624484_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|433aa|down_4|NC_015675.1_3624480_3625779_-	COG5379, BtaA, S-adenosylmethionine:diacylglycerol 3-amino-3-carboxypropyl transferase [Lipid metabolism]	NA|278aa|down_5|NC_015675.1_3625914_3626748_-	COG4870, COG4870, Cysteine protease [Posttranslational modification, protein turnover, chaperones]	NA|384aa|down_6|NC_015675.1_3626979_3628131_+	pfam01048, PNP_UDP_1, Phosphorylase superfamily	NA|1250aa|down_7|NC_015675.1_3628175_3631925_-	PLN02666, PLN02666, 5-oxoprolinase	NA|70aa|down_8|NC_015675.1_3632105_3632315_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|240aa|down_9|NC_015675.1_3632476_3633196_+	COG1076, DjlA, DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
GCF_000176035.2_ASM17603v2	NC_015675	Mesorhizobium opportunistum WSM2075, complete sequence	4	6806036-6806111	3	CRISPRCasFinder	no		WYL,csa3,DEDDh,cas3	Orphan	GCGATCGCGACGGCGACGCCGTC	23	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,DEDDh,cas3	NA|145aa|up_7|NC_015675.1_6799612_6800047_-,NA|88aa|up_2|NC_015675.1_6804017_6804281_-,NA	NA|1164aa|up_9|NC_015675.1_6795332_6798824_+	COG0591, PutP, Na+/proline symporter [Amino acid transport and metabolism / General function prediction only]	NA|220aa|up_8|NC_015675.1_6798825_6799485_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|145aa|up_7|NC_015675.1_6799612_6800047_-	NA	NA|117aa|up_6|NC_015675.1_6800210_6800561_+	COG3502, COG3502, Uncharacterized protein conserved in bacteria [Function unknown]	NA|360aa|up_5|NC_015675.1_6800557_6801637_+	cd04738, DHOD_2_like, Dihydroorotate dehydrogenase (DHOD) class 2	NA|447aa|up_4|NC_015675.1_6801629_6802970_-	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|299aa|up_3|NC_015675.1_6803101_6803998_+	cd09993, HDAC_classIV, Histone deacetylase class IV also known as histone deacetylase 11	NA|88aa|up_2|NC_015675.1_6804017_6804281_-	NA	NA|330aa|up_1|NC_015675.1_6804549_6805539_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|152aa|up_0|NC_015675.1_6805568_6806024_+	pfam04138, GtrA, GtrA-like protein	NA|254aa|down_0|NC_015675.1_6809926_6810688_-	COG4424, COG4424, Uncharacterized protein conserved in bacteria [Function unknown]	NA|330aa|down_1|NC_015675.1_6811403_6812393_-	cd08435, PBP2_GbpR, The C-terminal substrate binding domain of galactose-binding protein regulator contains the type 2 periplasmic binding fold	NA|357aa|down_2|NC_015675.1_6812681_6813752_+	COG4213, XylF, ABC-type xylose transport system, periplasmic component [Carbohydrate transport and metabolism]	NA|511aa|down_3|NC_015675.1_6813851_6815384_+	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]	NA|429aa|down_4|NC_015675.1_6815380_6816667_+	COG4214, XylH, ABC-type xylose transport system, permease component [Carbohydrate transport and metabolism]	NA|331aa|down_5|NC_015675.1_6816959_6817952_+	COG3802, GguC, Uncharacterized protein conserved in bacteria [Function unknown]	NA|258aa|down_6|NC_015675.1_6818147_6818921_-	cd13702, PBP2_mlr5654_like, Substrate binding domain of ABC-type histidine/lysine/arginine/ornithine transporter-like; the type 2 periplasmic-binding protein fold	NA|365aa|down_7|NC_015675.1_6819020_6820115_-	PRK07559, PRK07559, 2'-deoxycytidine 5'-triphosphate deaminase; Provisional	NA|397aa|down_8|NC_015675.1_6820336_6821527_+	PRK07504, PRK07504, O-succinylhomoserine sulfhydrylase; Reviewed	NA|384aa|down_9|NC_015675.1_6821563_6822715_-	cd02520, Glucosylceramide_synthase, Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis
