assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000520875.1_ASM52087v1	NZ_CP007067	Rhizobium leguminosarum bv. trifolii CB782 chromosome, complete genome	1	781902-782001	1	CRISPRCasFinder	no		cas3,csa3,WYL,DEDDh	Orphan	TCTCCCCGCCTGCGGGGAGAAGGG	24	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,DEDDh	NA|129aa|up_2|NZ_CP007067.1_779205_779592_+,NA|158aa|up_1|NZ_CP007067.1_779609_780083_+,NA	NA|307aa|up_9|NZ_CP007067.1_771114_772035_+	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain	NA|318aa|up_8|NZ_CP007067.1_772091_773045_-	cd05251, NmrA_like_SDR_a, NmrA (a transcriptional regulator) and HSCARG (an NADPH sensor) like proteins, atypical (a) SDRs	NA|499aa|up_7|NZ_CP007067.1_773534_775031_-	PRK05124, cysN, sulfate adenylyltransferase subunit 1; Provisional	NA|318aa|up_6|NZ_CP007067.1_775032_775986_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|253aa|up_5|NZ_CP007067.1_776100_776859_-	TIGR02055, Phosphoadenosine_phosphosulfate_reductase, thioredoxin-dependent adenylylsulfate APS reductase	NA|149aa|up_4|NZ_CP007067.1_777278_777725_+	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|448aa|up_3|NZ_CP007067.1_777725_779069_-	cd01034, EriC_like, ClC chloride channel family	NA|129aa|up_2|NZ_CP007067.1_779205_779592_+	NA	NA|158aa|up_1|NZ_CP007067.1_779609_780083_+	NA	NA|551aa|up_0|NZ_CP007067.1_780167_781820_-	PRK02106, PRK02106, choline dehydrogenase; Validated	NA|488aa|down_0|NZ_CP007067.1_782035_783499_-	PRK13252, PRK13252, betaine aldehyde dehydrogenase; Provisional	NA|194aa|down_1|NZ_CP007067.1_783500_784082_-	PRK00767, PRK00767, transcriptional regulator BetI; Validated	NA|200aa|down_2|NZ_CP007067.1_784238_784838_+	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|81aa|down_3|NZ_CP007067.1_784944_785187_+	COG3609, COG3609, Predicted transcriptional regulators containing the CopG/Arc/MetJ DNA-binding domain [Transcription]	NA|333aa|down_4|NZ_CP007067.1_786583_787582_+	cd05276, p53_inducible_oxidoreductase, PIG3 p53-inducible quinone oxidoreductase	NA|476aa|down_5|NZ_CP007067.1_787562_788990_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|208aa|down_6|NZ_CP007067.1_789093_789717_+	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|404aa|down_7|NZ_CP007067.1_789741_790953_-	cd17471, MFS_Set, Sugar efflux transporter (Set) family of the Major Facilitator Superfamily of transporters	NA|336aa|down_8|NZ_CP007067.1_791384_792392_+	COG4448, AnsA, L-asparaginase II [Amino acid transport and metabolism]	NA|252aa|down_9|NZ_CP007067.1_792466_793222_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]
GCF_000520875.1_ASM52087v1	NZ_CP007067	Rhizobium leguminosarum bv. trifolii CB782 chromosome, complete genome	2	1509443-1509528	2	CRISPRCasFinder	no		cas3,csa3,WYL,DEDDh	Orphan	GCCGCCCCGAAGAAGAAGGCTGC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,DEDDh	NA|156aa|up_5|NZ_CP007067.1_1502527_1502995_+,NA|58aa|down_7|NZ_CP007067.1_1517336_1517510_+,NA|111aa|down_8|NZ_CP007067.1_1517506_1517839_+	NA|116aa|up_9|NZ_CP007067.1_1498269_1498617_+	COG1862, YajC, Preprotein translocase subunit YajC [Intracellular trafficking and secretion]	NA|847aa|up_8|NZ_CP007067.1_1498662_1501203_+	PRK14726, PRK14726, protein translocase subunit SecDF	NA|129aa|up_7|NZ_CP007067.1_1501207_1501594_+	COG3737, COG3737, Uncharacterized conserved protein [Function unknown]	NA|286aa|up_6|NZ_CP007067.1_1501597_1502455_+	COG1562, ERG9, Phytoene/squalene synthetase [Lipid metabolism]	NA|156aa|up_5|NZ_CP007067.1_1502527_1502995_+	NA	NA|478aa|up_4|NZ_CP007067.1_1503003_1504437_-	PRK05335, PRK05335, tRNA (uracil-5-)-methyltransferase Gid; Reviewed	NA|48aa|up_3|NZ_CP007067.1_1504537_1504681_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|49aa|up_2|NZ_CP007067.1_1504982_1505129_-	COG5457, COG5457, Uncharacterized conserved small protein [Function unknown]	NA|368aa|up_1|NZ_CP007067.1_1505975_1507079_+	COG4872, COG4872, Predicted membrane protein [Function unknown]	NA|197aa|up_0|NZ_CP007067.1_1507075_1507666_+	pfam14345, GDYXXLXY, GDYXXLXY protein	NA|141aa|down_0|NZ_CP007067.1_1509795_1510218_+	COG3631, COG3631, Ketosteroid isomerase-related protein [General function prediction only]	NA|590aa|down_1|NZ_CP007067.1_1510297_1512067_-	smart00729, Elp3, Elongator protein 3, MiaB family, Radical SAM	NA|268aa|down_2|NZ_CP007067.1_1512228_1513032_-	cd09083, EEP-1, Exonuclease-Endonuclease-Phosphatase domain; uncharacterized family 1	NA|469aa|down_3|NZ_CP007067.1_1513096_1514503_-	PRK05249, PRK05249, Si-specific NAD(P)(+) transhydrogenase	NA|276aa|down_4|NZ_CP007067.1_1514622_1515450_-	PRK00024, PRK00024, DNA repair protein RadC	NA|279aa|down_5|NZ_CP007067.1_1515457_1516294_-	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|220aa|down_6|NZ_CP007067.1_1516544_1517204_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|58aa|down_7|NZ_CP007067.1_1517336_1517510_+	NA	NA|111aa|down_8|NZ_CP007067.1_1517506_1517839_+	NA	NA|165aa|down_9|NZ_CP007067.1_1517775_1518270_-	PRK00109, PRK00109, Holliday junction resolvase RuvX
GCF_000520875.1_ASM52087v1	NZ_CP007067	Rhizobium leguminosarum bv. trifolii CB782 chromosome, complete genome	3	1630790-1630890	3	CRISPRCasFinder	no		cas3,csa3,WYL,DEDDh	Orphan	GACCCAAAGCATGTCGCGCAAAAGTGTGCAGCGGTTT	37	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL,DEDDh	NA,NA|76aa|down_1|NZ_CP007067.1_1631393_1631621_-,NA|57aa|down_2|NZ_CP007067.1_1631779_1631950_+,NA|61aa|down_4|NZ_CP007067.1_1635042_1635225_-,NA|57aa|down_8|NZ_CP007067.1_1637534_1637705_+,NA|79aa|down_9|NZ_CP007067.1_1637704_1637941_+	NA|196aa|up_9|NZ_CP007067.1_1620478_1621066_-	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|479aa|up_8|NZ_CP007067.1_1621333_1622770_+	cd11642, SUMT, Uroporphyrin-III C-methyltransferase (also known as S-Adenosyl-L-methionine:uroporphyrinogen III methyltransferase, SUMT)	NA|105aa|up_7|NZ_CP007067.1_1622772_1623087_+	pfam11011, DUF2849, Protein of unknown function (DUF2849)	NA|557aa|up_6|NZ_CP007067.1_1623095_1624766_+	COG0155, CysI, Sulfite reductase, beta subunit (hemoprotein) [Inorganic ion transport and metabolism]	NA|171aa|up_5|NZ_CP007067.1_1624776_1625289_+	COG3749, COG3749, Uncharacterized protein conserved in bacteria [Function unknown]	NA|404aa|up_4|NZ_CP007067.1_1625327_1626539_+	PRK07494, PRK07494, UbiH/UbiF family hydroxylase	NA|321aa|up_3|NZ_CP007067.1_1626506_1627469_-	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|312aa|up_2|NZ_CP007067.1_1627643_1628579_-	cd07720, OPHC2-like_MBL-fold, Pseudomonas pseudoalcaligenes organophosphorus hydrolase C2, and related proteins; MBL-fold metallo hydrolase domain	NA|250aa|up_1|NZ_CP007067.1_1628760_1629510_-	COG0785, CcdA, Cytochrome c biogenesis protein [Posttranslational modification, protein turnover, chaperones]	NA|209aa|up_0|NZ_CP007067.1_1630099_1630726_+	COG3637, COG3637, Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane]	NA|68aa|down_0|NZ_CP007067.1_1631077_1631281_+	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|76aa|down_1|NZ_CP007067.1_1631393_1631621_-	NA	NA|57aa|down_2|NZ_CP007067.1_1631779_1631950_+	NA	NA|505aa|down_3|NZ_CP007067.1_1632275_1633790_+	PRK06834, PRK06834, hypothetical protein; Provisional	NA|61aa|down_4|NZ_CP007067.1_1635042_1635225_-	NA	NA|98aa|down_5|NZ_CP007067.1_1635531_1635825_+	pfam06169, DUF982, Protein of unknown function (DUF982)	NA|276aa|down_6|NZ_CP007067.1_1635962_1636790_-	cd07302, CHD, cyclase homology domain	NA|86aa|down_7|NZ_CP007067.1_1637175_1637433_+	pfam06169, DUF982, Protein of unknown function (DUF982)	NA|57aa|down_8|NZ_CP007067.1_1637534_1637705_+	NA	NA|79aa|down_9|NZ_CP007067.1_1637704_1637941_+	NA
