assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001664405.1_ASM166440v1	NZ_CP013568	Rhizobium phaseoli strain N771 chromosome, complete genome	1	1216745-1216854	1	CRISPRCasFinder	no	WYL	DEDDh,WYL,cas3,csa3	Unclear	GCGCAGCGGGAGCAATCGATCCAGTGAAT	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA|105aa|up_8|NZ_CP013568.1_1207527_1207842_+,NA|185aa|down_0|NZ_CP013568.1_1216956_1217511_-	NA|152aa|up_9|NZ_CP013568.1_1206940_1207396_+	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|105aa|up_8|NZ_CP013568.1_1207527_1207842_+	NA	NA|506aa|up_7|NZ_CP013568.1_1207854_1209372_-	PRK12853, PRK12853, glucose-6-phosphate dehydrogenase	NA|257aa|up_6|NZ_CP013568.1_1209723_1210494_-	PRK01683, PRK01683, trans-aconitate 2-methyltransferase; Provisional	NA|425aa|up_5|NZ_CP013568.1_1210581_1211856_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|155aa|up_4|NZ_CP013568.1_1212256_1212721_-	pfam07883, Cupin_2, Cupin domain	NA|209aa|up_3|NZ_CP013568.1_1212792_1213419_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|295aa|up_2|NZ_CP013568.1_1213496_1214381_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|347aa|up_1|NZ_CP013568.1_1214554_1215595_-	cd08260, Zn_ADH6, Alcohol dehydrogenases of the MDR family	NA|296aa|up_0|NZ_CP013568.1_1215845_1216733_+	PRK13356, PRK13356, branched-chain amino acid aminotransferase	NA|185aa|down_0|NZ_CP013568.1_1216956_1217511_-	NA	NA|300aa|down_1|NZ_CP013568.1_1217654_1218554_-	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|414aa|down_2|NZ_CP013568.1_1218691_1219933_+	COG1566, EmrA, Multidrug resistance efflux pump [Defense mechanisms]	NA|540aa|down_3|NZ_CP013568.1_1220004_1221624_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|397aa|down_4|NZ_CP013568.1_1221650_1222841_+	cd07302, CHD, cyclase homology domain	NA|278aa|down_5|NZ_CP013568.1_1223004_1223838_+	PRK00768, nadE, ammonia-dependent NAD(+) synthetase	NA|228aa|down_6|NZ_CP013568.1_1223847_1224531_-	PRK10542, PRK10542, glutathionine S-transferase; Provisional	NA|206aa|down_7|NZ_CP013568.1_1224646_1225264_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|425aa|down_8|NZ_CP013568.1_1225225_1226500_-	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|114aa|down_9|NZ_CP013568.1_1226613_1226955_+	pfam12680, SnoaL_2, SnoaL-like domain
GCF_001664405.1_ASM166440v1	NZ_CP013568	Rhizobium phaseoli strain N771 chromosome, complete genome	2	1467292-1467385	2	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	CAAGCTTGCTTGTCCGCCCGCGGCAAG	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA,NA|168aa|down_4|NZ_CP013568.1_1470522_1471026_-	NA|681aa|up_9|NZ_CP013568.1_1455541_1457584_-	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|435aa|up_8|NZ_CP013568.1_1457685_1458990_-	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|453aa|up_7|NZ_CP013568.1_1459004_1460363_-	pfam03972, MmgE_PrpD, MmgE/PrpD family	NA|262aa|up_6|NZ_CP013568.1_1460372_1461158_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|264aa|up_5|NZ_CP013568.1_1461162_1461954_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|249aa|up_4|NZ_CP013568.1_1461950_1462697_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|277aa|up_3|NZ_CP013568.1_1462721_1463552_-	cd13693, PBP2_polar_AA, Substrate binding domain of polar amino-acid uptake  ABC transporter; the type 2 periplasmic binding protein fold	NA|195aa|up_2|NZ_CP013568.1_1463659_1464244_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|282aa|up_1|NZ_CP013568.1_1464380_1465226_-	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|628aa|up_0|NZ_CP013568.1_1465354_1467238_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|350aa|down_0|NZ_CP013568.1_1467409_1468459_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|170aa|down_1|NZ_CP013568.1_1468598_1469108_-	COG2318, DinB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|230aa|down_2|NZ_CP013568.1_1469245_1469935_+	cd03202, GST_C_etherase_LigE, C-terminal, alpha helical domain of Beta etherase LigE	NA|141aa|down_3|NZ_CP013568.1_1470016_1470439_+	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|168aa|down_4|NZ_CP013568.1_1470522_1471026_-	NA	NA|182aa|down_5|NZ_CP013568.1_1471022_1471568_-	COG5516, COG5516, Conserved protein containing a Zn-ribbon-like motif, possibly RNA-binding [General function prediction only]	NA|299aa|down_6|NZ_CP013568.1_1471633_1472530_+	COG0559, LivH, Branched-chain amino acid ABC-type transport system, permease components [Amino acid transport and metabolism]	NA|154aa|down_7|NZ_CP013568.1_1472580_1473042_-	COG0314, MoaE, Molybdopterin converting factor, large subunit [Coenzyme metabolism]	NA|85aa|down_8|NZ_CP013568.1_1473286_1473541_-	TIGR01682, Molybdopterin_synthase_sulfur_carrier_subunit, molybdopterin converting factor, subunit 1, non-archaeal	NA|197aa|down_9|NZ_CP013568.1_1473537_1474128_-	TIGR00560, pgsA, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase
GCF_001664405.1_ASM166440v1	NZ_CP013568	Rhizobium phaseoli strain N771 chromosome, complete genome	3	1926725-1926813	3	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	GCCGCTCCGAAGAAGAAGGCTGC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA|156aa|up_5|NZ_CP013568.1_1919738_1920206_+,NA|58aa|down_9|NZ_CP013568.1_1935960_1936134_+	NA|116aa|up_9|NZ_CP013568.1_1915484_1915832_+	COG1862, YajC, Preprotein translocase subunit YajC [Intracellular trafficking and secretion]	NA|847aa|up_8|NZ_CP013568.1_1915876_1918417_+	PRK14726, PRK14726, protein translocase subunit SecDF	NA|129aa|up_7|NZ_CP013568.1_1918421_1918808_+	COG3737, COG3737, Uncharacterized conserved protein [Function unknown]	NA|286aa|up_6|NZ_CP013568.1_1918810_1919668_+	COG1562, ERG9, Phytoene/squalene synthetase [Lipid metabolism]	NA|156aa|up_5|NZ_CP013568.1_1919738_1920206_+	NA	NA|477aa|up_4|NZ_CP013568.1_1920215_1921646_-	PRK05335, PRK05335, tRNA (uracil-5-)-methyltransferase Gid; Reviewed	NA|48aa|up_3|NZ_CP013568.1_1921744_1921888_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|49aa|up_2|NZ_CP013568.1_1922187_1922334_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|368aa|up_1|NZ_CP013568.1_1923231_1924335_+	COG4872, COG4872, Predicted membrane protein [Function unknown]	NA|197aa|up_0|NZ_CP013568.1_1924331_1924922_+	pfam14345, GDYXXLXY, GDYXXLXY protein	NA|141aa|down_0|NZ_CP013568.1_1927004_1927427_+	COG3631, COG3631, Ketosteroid isomerase-related protein [General function prediction only]	NA|592aa|down_1|NZ_CP013568.1_1927521_1929297_-	smart00729, Elp3, Elongator protein 3, MiaB family, Radical SAM	NA|268aa|down_2|NZ_CP013568.1_1929456_1930260_-	cd08372, EEP, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily	NA|469aa|down_3|NZ_CP013568.1_1930319_1931726_-	PRK05249, PRK05249, Si-specific NAD(P)(+) transhydrogenase	NA|42aa|down_4|NZ_CP013568.1_1931824_1931950_-	PRK00024, PRK00024, DNA repair protein RadC	NA|396aa|down_5|NZ_CP013568.1_1931975_1933163_-	pfam01070, FMN_dh, FMN-dependent dehydrogenase	NA|275aa|down_6|NZ_CP013568.1_1933265_1934090_-	PRK00024, PRK00024, DNA repair protein RadC	NA|279aa|down_7|NZ_CP013568.1_1934097_1934934_-	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|220aa|down_8|NZ_CP013568.1_1935170_1935830_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|58aa|down_9|NZ_CP013568.1_1935960_1936134_+	NA
GCF_001664405.1_ASM166440v1	NZ_CP013569	Rhizobium phaseoli strain N771 plasmid pRphaN771a, complete sequence	1	51325-51427	1	CRISPRCasFinder	no			Orphan	CTTCCGCAAGGGTCCCGACTTCTCG	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA|237aa|up_7|NZ_CP013569.1_41812_42523_-,NA|97aa|up_6|NZ_CP013569.1_42770_43061_-,NA|223aa|up_5|NZ_CP013569.1_43264_43933_-,NA|264aa|up_4|NZ_CP013569.1_44385_45177_+,NA|57aa|up_2|NZ_CP013569.1_47948_48119_+,NA|238aa|up_1|NZ_CP013569.1_48217_48931_+,NA|423aa|up_0|NZ_CP013569.1_49071_50340_+,NA|101aa|down_1|NZ_CP013569.1_53870_54173_+,NA|92aa|down_2|NZ_CP013569.1_54181_54457_+,NA|98aa|down_3|NZ_CP013569.1_54467_54761_+,NA|68aa|down_4|NZ_CP013569.1_54757_54961_+,NA|230aa|down_5|NZ_CP013569.1_54975_55665_+,NA|66aa|down_6|NZ_CP013569.1_55674_55872_+,NA|171aa|down_9|NZ_CP013569.1_57168_57681_+	NA|388aa|up_9|NZ_CP013569.1_40032_41196_+	pfam14082, DUF4263, Domain of unknown function (DUF4263)	NA|192aa|up_8|NZ_CP013569.1_41208_41784_-	cd10436, GIY-YIG_EndoII_Hpy188I_like, Catalytic GIY-YIG domain of coliphage T4 non-specific endonuclease II, type II restriction endonuclease R	NA|237aa|up_7|NZ_CP013569.1_41812_42523_-	NA	NA|97aa|up_6|NZ_CP013569.1_42770_43061_-	NA	NA|223aa|up_5|NZ_CP013569.1_43264_43933_-	NA	NA|264aa|up_4|NZ_CP013569.1_44385_45177_+	NA	NA|833aa|up_3|NZ_CP013569.1_45182_47681_+	pfam10412, TrwB_AAD_bind, Type IV secretion-system coupling protein DNA-binding domain	NA|57aa|up_2|NZ_CP013569.1_47948_48119_+	NA	NA|238aa|up_1|NZ_CP013569.1_48217_48931_+	NA	NA|423aa|up_0|NZ_CP013569.1_49071_50340_+	NA	NA|480aa|down_0|NZ_CP013569.1_52081_53521_+	PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase; Provisional	NA|101aa|down_1|NZ_CP013569.1_53870_54173_+	NA	NA|92aa|down_2|NZ_CP013569.1_54181_54457_+	NA	NA|98aa|down_3|NZ_CP013569.1_54467_54761_+	NA	NA|68aa|down_4|NZ_CP013569.1_54757_54961_+	NA	NA|230aa|down_5|NZ_CP013569.1_54975_55665_+	NA	NA|66aa|down_6|NZ_CP013569.1_55674_55872_+	NA	NA|290aa|down_7|NZ_CP013569.1_55873_56743_+	cd07404, MPP_MS158, Microscilla MS158 and related proteins, metallophosphatase domain	NA|133aa|down_8|NZ_CP013569.1_56769_57168_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|171aa|down_9|NZ_CP013569.1_57168_57681_+	NA
