assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001664365.1_ASM166436v1	NZ_CP013574	Rhizobium phaseoli strain N671 chromosome, complete genome	1	1216598-1216707	1	CRISPRCasFinder	no	WYL	DEDDh,WYL,cas3,csa3	Unclear	GCGCAGCGGGAGCAATCGATCCAGTGAAT	29	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA|105aa|up_8|NZ_CP013574.1_1207380_1207695_+,NA|185aa|down_0|NZ_CP013574.1_1216809_1217364_-	NA|152aa|up_9|NZ_CP013574.1_1206793_1207249_+	cd04688, Nudix_Hydrolase_29, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|105aa|up_8|NZ_CP013574.1_1207380_1207695_+	NA	NA|506aa|up_7|NZ_CP013574.1_1207707_1209225_-	PRK12853, PRK12853, glucose-6-phosphate dehydrogenase	NA|257aa|up_6|NZ_CP013574.1_1209576_1210347_-	PRK01683, PRK01683, trans-aconitate 2-methyltransferase; Provisional	NA|425aa|up_5|NZ_CP013574.1_1210434_1211709_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|155aa|up_4|NZ_CP013574.1_1212109_1212574_-	pfam07883, Cupin_2, Cupin domain	NA|209aa|up_3|NZ_CP013574.1_1212645_1213272_-	cd01014, nicotinamidase_related, Nicotinamidase_ related amidohydrolases	NA|295aa|up_2|NZ_CP013574.1_1213349_1214234_+	COG0583, LysR, Transcriptional regulator [Transcription]	NA|347aa|up_1|NZ_CP013574.1_1214407_1215448_-	cd08260, Zn_ADH6, Alcohol dehydrogenases of the MDR family	NA|296aa|up_0|NZ_CP013574.1_1215698_1216586_+	PRK13356, PRK13356, branched-chain amino acid aminotransferase	NA|185aa|down_0|NZ_CP013574.1_1216809_1217364_-	NA	NA|300aa|down_1|NZ_CP013574.1_1217507_1218407_-	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|414aa|down_2|NZ_CP013574.1_1218544_1219786_+	COG1566, EmrA, Multidrug resistance efflux pump [Defense mechanisms]	NA|540aa|down_3|NZ_CP013574.1_1219857_1221477_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|397aa|down_4|NZ_CP013574.1_1221503_1222694_+	cd07302, CHD, cyclase homology domain	NA|278aa|down_5|NZ_CP013574.1_1222857_1223691_+	PRK00768, nadE, ammonia-dependent NAD(+) synthetase	NA|228aa|down_6|NZ_CP013574.1_1223700_1224384_-	PRK10542, PRK10542, glutathionine S-transferase; Provisional	NA|206aa|down_7|NZ_CP013574.1_1224499_1225117_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|425aa|down_8|NZ_CP013574.1_1225078_1226353_-	cd17391, MFS_MdtG_MDR_like, Multidrug resistance protein MdtG and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|114aa|down_9|NZ_CP013574.1_1226466_1226808_+	pfam12680, SnoaL_2, SnoaL-like domain
GCF_001664365.1_ASM166436v1	NZ_CP013574	Rhizobium phaseoli strain N671 chromosome, complete genome	2	1467145-1467238	2	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	CAAGCTTGCTTGTCCGCCCGCGGCAAG	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA,NA|168aa|down_4|NZ_CP013574.1_1470375_1470879_-	NA|681aa|up_9|NZ_CP013574.1_1455394_1457437_-	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|435aa|up_8|NZ_CP013574.1_1457538_1458843_-	TIGR03860, FMN_nitrolo, FMN-dependent oxidoreductase, nitrilotriacetate monooxygenase family	NA|453aa|up_7|NZ_CP013574.1_1458857_1460216_-	pfam03972, MmgE_PrpD, MmgE/PrpD family	NA|262aa|up_6|NZ_CP013574.1_1460225_1461011_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|264aa|up_5|NZ_CP013574.1_1461015_1461807_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|249aa|up_4|NZ_CP013574.1_1461803_1462550_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|277aa|up_3|NZ_CP013574.1_1462574_1463405_-	cd13693, PBP2_polar_AA, Substrate binding domain of polar amino-acid uptake  ABC transporter; the type 2 periplasmic binding protein fold	NA|195aa|up_2|NZ_CP013574.1_1463512_1464097_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|282aa|up_1|NZ_CP013574.1_1464233_1465079_-	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|628aa|up_0|NZ_CP013574.1_1465207_1467091_-	COG0488, Uup, ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only]	NA|350aa|down_0|NZ_CP013574.1_1467262_1468312_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|170aa|down_1|NZ_CP013574.1_1468451_1468961_-	COG2318, DinB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|230aa|down_2|NZ_CP013574.1_1469098_1469788_+	cd03202, GST_C_etherase_LigE, C-terminal, alpha helical domain of Beta etherase LigE	NA|141aa|down_3|NZ_CP013574.1_1469869_1470292_+	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|168aa|down_4|NZ_CP013574.1_1470375_1470879_-	NA	NA|182aa|down_5|NZ_CP013574.1_1470875_1471421_-	COG5516, COG5516, Conserved protein containing a Zn-ribbon-like motif, possibly RNA-binding [General function prediction only]	NA|299aa|down_6|NZ_CP013574.1_1471486_1472383_+	COG0559, LivH, Branched-chain amino acid ABC-type transport system, permease components [Amino acid transport and metabolism]	NA|154aa|down_7|NZ_CP013574.1_1472433_1472895_-	COG0314, MoaE, Molybdopterin converting factor, large subunit [Coenzyme metabolism]	NA|85aa|down_8|NZ_CP013574.1_1473139_1473394_-	TIGR01682, Molybdopterin_synthase_sulfur_carrier_subunit, molybdopterin converting factor, subunit 1, non-archaeal	NA|197aa|down_9|NZ_CP013574.1_1473390_1473981_-	TIGR00560, pgsA, CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase
GCF_001664365.1_ASM166436v1	NZ_CP013574	Rhizobium phaseoli strain N671 chromosome, complete genome	3	1926578-1926666	3	CRISPRCasFinder	no		DEDDh,WYL,cas3,csa3	Orphan	GCCGCTCCGAAGAAGAAGGCTGC	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA|156aa|up_5|NZ_CP013574.1_1919591_1920059_+,NA|58aa|down_9|NZ_CP013574.1_1935813_1935987_+	NA|116aa|up_9|NZ_CP013574.1_1915337_1915685_+	COG1862, YajC, Preprotein translocase subunit YajC [Intracellular trafficking and secretion]	NA|847aa|up_8|NZ_CP013574.1_1915729_1918270_+	PRK14726, PRK14726, protein translocase subunit SecDF	NA|129aa|up_7|NZ_CP013574.1_1918274_1918661_+	COG3737, COG3737, Uncharacterized conserved protein [Function unknown]	NA|286aa|up_6|NZ_CP013574.1_1918663_1919521_+	COG1562, ERG9, Phytoene/squalene synthetase [Lipid metabolism]	NA|156aa|up_5|NZ_CP013574.1_1919591_1920059_+	NA	NA|477aa|up_4|NZ_CP013574.1_1920068_1921499_-	PRK05335, PRK05335, tRNA (uracil-5-)-methyltransferase Gid; Reviewed	NA|48aa|up_3|NZ_CP013574.1_1921597_1921741_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|49aa|up_2|NZ_CP013574.1_1922040_1922187_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|368aa|up_1|NZ_CP013574.1_1923084_1924188_+	COG4872, COG4872, Predicted membrane protein [Function unknown]	NA|197aa|up_0|NZ_CP013574.1_1924184_1924775_+	pfam14345, GDYXXLXY, GDYXXLXY protein	NA|141aa|down_0|NZ_CP013574.1_1926857_1927280_+	COG3631, COG3631, Ketosteroid isomerase-related protein [General function prediction only]	NA|592aa|down_1|NZ_CP013574.1_1927374_1929150_-	smart00729, Elp3, Elongator protein 3, MiaB family, Radical SAM	NA|268aa|down_2|NZ_CP013574.1_1929309_1930113_-	cd08372, EEP, Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily	NA|469aa|down_3|NZ_CP013574.1_1930172_1931579_-	PRK05249, PRK05249, Si-specific NAD(P)(+) transhydrogenase	NA|42aa|down_4|NZ_CP013574.1_1931677_1931803_-	PRK00024, PRK00024, DNA repair protein RadC	NA|396aa|down_5|NZ_CP013574.1_1931828_1933016_-	pfam01070, FMN_dh, FMN-dependent dehydrogenase	NA|275aa|down_6|NZ_CP013574.1_1933118_1933943_-	PRK00024, PRK00024, DNA repair protein RadC	NA|279aa|down_7|NZ_CP013574.1_1933950_1934787_-	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|220aa|down_8|NZ_CP013574.1_1935023_1935683_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|58aa|down_9|NZ_CP013574.1_1935813_1935987_+	NA
GCF_001664365.1_ASM166436v1	NZ_CP013575	Rhizobium phaseoli strain N671 plasmid pRphaN671a, complete sequence	1	51325-51427	1	CRISPRCasFinder	no			Orphan	CTTCCGCAAGGGTCCCGACTTCTCG	25	0	0	NA	NA	NA	1	1	Orphan	DEDDh,WYL,cas3,csa3,PD-DExK,RT	NA|237aa|up_7|NZ_CP013575.1_41812_42523_-,NA|97aa|up_6|NZ_CP013575.1_42770_43061_-,NA|223aa|up_5|NZ_CP013575.1_43264_43933_-,NA|264aa|up_4|NZ_CP013575.1_44385_45177_+,NA|57aa|up_2|NZ_CP013575.1_47948_48119_+,NA|238aa|up_1|NZ_CP013575.1_48217_48931_+,NA|423aa|up_0|NZ_CP013575.1_49071_50340_+,NA|101aa|down_1|NZ_CP013575.1_53870_54173_+,NA|92aa|down_2|NZ_CP013575.1_54181_54457_+,NA|98aa|down_3|NZ_CP013575.1_54467_54761_+,NA|68aa|down_4|NZ_CP013575.1_54757_54961_+,NA|230aa|down_5|NZ_CP013575.1_54975_55665_+,NA|66aa|down_6|NZ_CP013575.1_55674_55872_+,NA|171aa|down_9|NZ_CP013575.1_57168_57681_+	NA|388aa|up_9|NZ_CP013575.1_40032_41196_+	pfam14082, DUF4263, Domain of unknown function (DUF4263)	NA|192aa|up_8|NZ_CP013575.1_41208_41784_-	cd10436, GIY-YIG_EndoII_Hpy188I_like, Catalytic GIY-YIG domain of coliphage T4 non-specific endonuclease II, type II restriction endonuclease R	NA|237aa|up_7|NZ_CP013575.1_41812_42523_-	NA	NA|97aa|up_6|NZ_CP013575.1_42770_43061_-	NA	NA|223aa|up_5|NZ_CP013575.1_43264_43933_-	NA	NA|264aa|up_4|NZ_CP013575.1_44385_45177_+	NA	NA|833aa|up_3|NZ_CP013575.1_45182_47681_+	pfam10412, TrwB_AAD_bind, Type IV secretion-system coupling protein DNA-binding domain	NA|57aa|up_2|NZ_CP013575.1_47948_48119_+	NA	NA|238aa|up_1|NZ_CP013575.1_48217_48931_+	NA	NA|423aa|up_0|NZ_CP013575.1_49071_50340_+	NA	NA|480aa|down_0|NZ_CP013575.1_52081_53521_+	PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase; Provisional	NA|101aa|down_1|NZ_CP013575.1_53870_54173_+	NA	NA|92aa|down_2|NZ_CP013575.1_54181_54457_+	NA	NA|98aa|down_3|NZ_CP013575.1_54467_54761_+	NA	NA|68aa|down_4|NZ_CP013575.1_54757_54961_+	NA	NA|230aa|down_5|NZ_CP013575.1_54975_55665_+	NA	NA|66aa|down_6|NZ_CP013575.1_55674_55872_+	NA	NA|290aa|down_7|NZ_CP013575.1_55873_56743_+	cd07404, MPP_MS158, Microscilla MS158 and related proteins, metallophosphatase domain	NA|133aa|down_8|NZ_CP013575.1_56769_57168_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|171aa|down_9|NZ_CP013575.1_57168_57681_+	NA
