assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000975175.1_ASM97517v1	NZ_CP011295	Rhodococcus erythropolis strain BG43 chromosome, complete genome	1	128287-128398	1	CRISPRCasFinder	no		cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Orphan	TGAACGGGTTGTCGACGAGGGCGTTGGTGCC	31	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT,csf1gr8,csf4gr11,csf2gr7,csf3gr5,PD-DExK	NA,NA|151aa|down_6|NZ_CP011295.1_133715_134168_-,NA|416aa|down_8|NZ_CP011295.1_135886_137134_-	NA|401aa|up_9|NZ_CP011295.1_118064_119267_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|349aa|up_8|NZ_CP011295.1_119381_120428_+	COG0577, SalY, ABC-type antimicrobial peptide transport system, permease component [Defense mechanisms]	NA|247aa|up_7|NZ_CP011295.1_120424_121165_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|147aa|up_6|NZ_CP011295.1_121276_121717_+	pfam13426, PAS_9, PAS domain	NA|209aa|up_5|NZ_CP011295.1_121723_122350_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|324aa|up_4|NZ_CP011295.1_122401_123373_+	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]	NA|431aa|up_3|NZ_CP011295.1_123319_124612_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|447aa|up_2|NZ_CP011295.1_124608_125949_-	TIGR03604, hypothetical_protein, thiazole/oxazole-forming peptide maturase, SagD family component	NA|476aa|up_1|NZ_CP011295.1_125941_127369_-	COG2936, COG2936, Predicted acyl esterases [General function prediction only]	NA|271aa|up_0|NZ_CP011295.1_127365_128178_-	TIGR03882, hypothetical_protein, bacteriocin biosynthesis cyclodehydratase domain	NA|420aa|down_0|NZ_CP011295.1_128553_129813_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|356aa|down_1|NZ_CP011295.1_129897_130965_+	pfam01032, FecCD, FecCD transport family	NA|265aa|down_2|NZ_CP011295.1_130961_131756_+	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|327aa|down_3|NZ_CP011295.1_131752_132733_+	cd01148, TroA_a, Metal binding protein TroA_a	NA|229aa|down_4|NZ_CP011295.1_132737_133424_+	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|98aa|down_5|NZ_CP011295.1_133425_133719_-	pfam09851, SHOCT, Short C-terminal domain	NA|151aa|down_6|NZ_CP011295.1_133715_134168_-	NA	NA|549aa|down_7|NZ_CP011295.1_134171_135818_-	cd17321, MFS_MMR_MDR_like, Methylenomycin A resistance protein (also called MMR peptide) and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|416aa|down_8|NZ_CP011295.1_135886_137134_-	NA	NA|196aa|down_9|NZ_CP011295.1_137152_137740_+	pfam04978, DUF664, Protein of unknown function (DUF664)
GCF_000975175.1_ASM97517v1	NZ_CP011295	Rhodococcus erythropolis strain BG43 chromosome, complete genome	2	2251525-2251610	2	CRISPRCasFinder	no		cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Orphan	CACCGGATGCTGCGCCGCCGCCG	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT,csf1gr8,csf4gr11,csf2gr7,csf3gr5,PD-DExK	NA|117aa|up_1|NZ_CP011295.1_2250463_2250814_-,NA	NA|904aa|up_9|NZ_CP011295.1_2242468_2245180_-	PRK09800, PRK09800, putative hypoxanthine oxidase; Provisional	NA|270aa|up_8|NZ_CP011295.1_2245176_2245986_-	pfam00941, FAD_binding_5, FAD binding domain in molybdopterin dehydrogenase	NA|457aa|up_7|NZ_CP011295.1_2245993_2247364_-	PRK08203, PRK08203, hydroxydechloroatrazine ethylaminohydrolase; Reviewed	NA|316aa|up_6|NZ_CP011295.1_2247360_2248308_-	TIGR03383, Structure_Of_Uricase, urate oxidase	NA|110aa|up_5|NZ_CP011295.1_2248342_2248672_-	pfam00576, Transthyretin, HIUase/Transthyretin family	NA|173aa|up_4|NZ_CP011295.1_2248668_2249187_-	PRK13798, PRK13798, putative OHCU decarboxylase; Provisional	NA|267aa|up_3|NZ_CP011295.1_2249303_2250104_+	pfam13350, Y_phosphatase3, Tyrosine phosphatase family	NA|121aa|up_2|NZ_CP011295.1_2250104_2250467_-	COG5652, COG5652, Predicted integral membrane protein [Function unknown]	NA|117aa|up_1|NZ_CP011295.1_2250463_2250814_-	NA	NA|142aa|up_0|NZ_CP011295.1_2250813_2251239_-	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|467aa|down_0|NZ_CP011295.1_2253250_2254651_-	cd19539, SgcC5_NRPS-like, SgcC5 is a non-ribosomal peptide synthetase (NRPS) condensation enzyme with ester- and amide- bond forming activity and similar C-domains of modular NRPSs	NA|139aa|down_1|NZ_CP011295.1_2254655_2255072_-	pfam02657, SufE, Fe-S metabolism associated domain	NA|302aa|down_2|NZ_CP011295.1_2255068_2255974_-	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|488aa|down_3|NZ_CP011295.1_2256092_2257556_-	pfam00668, Condensation, Condensation domain	NA|498aa|down_4|NZ_CP011295.1_2257555_2259049_-	pfam00668, Condensation, Condensation domain	NA|566aa|down_5|NZ_CP011295.1_2259199_2260897_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|435aa|down_6|NZ_CP011295.1_2260960_2262265_+	pfam02720, DUF222, Domain of unknown function (DUF222)	NA|214aa|down_7|NZ_CP011295.1_2262374_2263016_-	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|114aa|down_8|NZ_CP011295.1_2263031_2263373_-	pfam13822, ACC_epsilon, Acyl-CoA carboxylase epsilon subunit	NA|547aa|down_9|NZ_CP011295.1_2263369_2265010_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]
GCF_000975175.1_ASM97517v1	NZ_CP011295	Rhodococcus erythropolis strain BG43 chromosome, complete genome	3	3128483-3128979	1	CRT	no	csa3	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Type I-A	CCCTGGTANCCACCGGANGAGCGGTTGCCCTGGTA	35	0	0	NA	NA	NA	7	7	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT,csf1gr8,csf4gr11,csf2gr7,csf3gr5,PD-DExK	NA|293aa|up_6|NZ_CP011295.1_3119143_3120022_-,NA|294aa|up_2|NZ_CP011295.1_3124240_3125122_-,NA|261aa|down_6|NZ_CP011295.1_3138363_3139146_+	NA|345aa|up_9|NZ_CP011295.1_3116276_3117311_-	cd02653, nuc_hydro_3, NH_3: A subgroup of nucleoside hydrolases	NA|159aa|up_8|NZ_CP011295.1_3117303_3117780_-	COG1764, osmC, Organic hydroperoxide reductase [Secondary metabolites biosynthesis, transport and catabolism]	NA|438aa|up_7|NZ_CP011295.1_3117811_3119125_-	pfam02515, CoA_transf_3, CoA-transferase family III	NA|293aa|up_6|NZ_CP011295.1_3119143_3120022_-	NA	NA|363aa|up_5|NZ_CP011295.1_3120025_3121114_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|461aa|up_4|NZ_CP011295.1_3121106_3122489_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|479aa|up_3|NZ_CP011295.1_3122800_3124237_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|294aa|up_2|NZ_CP011295.1_3124240_3125122_-	NA	NA|501aa|up_1|NZ_CP011295.1_3125126_3126629_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|486aa|up_0|NZ_CP011295.1_3126691_3128149_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|318aa|down_0|NZ_CP011295.1_3130596_3131550_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|138aa|down_1|NZ_CP011295.1_3131575_3131989_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|858aa|down_2|NZ_CP011295.1_3132180_3134754_+	pfam09924, DUF2156, Uncharacterized conserved protein (DUF2156)	NA|284aa|down_3|NZ_CP011295.1_3134764_3135616_-	COG1120, FepC, ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism]	NA|364aa|down_4|NZ_CP011295.1_3135612_3136704_-	COG4779, FepG, ABC-type enterobactin transport system, permease component [Inorganic ion transport and metabolism]	NA|487aa|down_5|NZ_CP011295.1_3136700_3138161_-	PRK10441, PRK10441, Fe(3+)-siderophore ABC transporter permease	NA|261aa|down_6|NZ_CP011295.1_3138363_3139146_+	NA	NA|143aa|down_7|NZ_CP011295.1_3139382_3139811_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|472aa|down_8|NZ_CP011295.1_3139898_3141314_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	csa3|255aa|down_9|NZ_CP011295.1_3141757_3142522_+	cd08893, SRPBCC_CalC_Aha1-like_GntR-HTH, Putative hydrophobic ligand-binding SRPBCC domain of an uncharacterized subgroup of CalC- and Aha1-like proteins; some contain an N-terminal GntR family winged HTH DNA-binding domain
GCF_000975175.1_ASM97517v1	NZ_CP011295	Rhodococcus erythropolis strain BG43 chromosome, complete genome	4	3219129-3219208	3	CRISPRCasFinder	no	WYL,cas4	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Unclear	CCCCACGGCTGTTGCTGGCCGGG	23	0	0	NA	NA	NA	1	1	Unclear	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT,csf1gr8,csf4gr11,csf2gr7,csf3gr5,PD-DExK	NA,NA	NA|120aa|up_9|NZ_CP011295.1_3210108_3210468_-	cd07238, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|214aa|up_8|NZ_CP011295.1_3210478_3211120_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|322aa|up_7|NZ_CP011295.1_3211279_3212245_-	cd06225, HAMP, Histidine kinase, Adenylyl cyclase, Methyl-accepting protein, and Phosphatase (HAMP) domain	NA|249aa|up_6|NZ_CP011295.1_3212345_3213092_-	PRK08057, PRK08057, cobalt-precorrin-6x reductase; Reviewed	NA|250aa|up_5|NZ_CP011295.1_3213115_3213865_-	COG2875, CobM, Precorrin-4 methylase [Coenzyme metabolism]	NA|428aa|up_4|NZ_CP011295.1_3213861_3215145_-	COG2242, CobL, Precorrin-6B methylase 2 [Coenzyme metabolism]	NA|254aa|up_3|NZ_CP011295.1_3215141_3215903_-	PRK05599, PRK05599, SDR family oxidoreductase	NA|142aa|up_2|NZ_CP011295.1_3215933_3216359_+	TIGR03618, Rv1155_F420, PPOX class probable F420-dependent enzyme	NA|379aa|up_1|NZ_CP011295.1_3216359_3217496_-	COG0006, PepP, Xaa-Pro aminopeptidase [Amino acid transport and metabolism]	NA|322aa|up_0|NZ_CP011295.1_3217533_3218499_+	smart00475, 53EXOc, 5'-3' exonuclease	NA|904aa|down_0|NZ_CP011295.1_3219389_3222101_-	COG4581, COG4581, Superfamily II RNA helicase [DNA replication, recombination, and repair]	NA|343aa|down_1|NZ_CP011295.1_3222149_3223178_-	pfam00902, TatC, Sec-independent protein translocase protein (TatC)	NA|92aa|down_2|NZ_CP011295.1_3223251_3223527_-	PRK00575, tatA, Sec-independent protein translocase subunit TatA	WYL|326aa|down_3|NZ_CP011295.1_3223661_3224639_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	WYL|338aa|down_4|NZ_CP011295.1_3224638_3225652_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|448aa|down_5|NZ_CP011295.1_3225762_3227106_-	TIGR03686, pupylate_PafA, Pup--protein ligase	NA|488aa|down_6|NZ_CP011295.1_3227220_3228684_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|256aa|down_7|NZ_CP011295.1_3228721_3229489_-	TIGR03691, 20S_bact_alpha, proteasome, alpha subunit, bacterial type	NA|293aa|down_8|NZ_CP011295.1_3229485_3230364_-	TIGR03690, 20S_bact_beta, proteasome, beta subunit, bacterial type	NA|65aa|down_9|NZ_CP011295.1_3230360_3230555_-	pfam05639, Pup, Pup-like protein
GCF_000975175.1_ASM97517v1	NZ_CP011295	Rhodococcus erythropolis strain BG43 chromosome, complete genome	5	4002654-4002789	4	CRISPRCasFinder	no	DinG	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT	Type IV-A	CGTTGGGTCGATCTGAGGCGACGAACGTACCGTTCACTCG	40	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,cas4,DEDDh,DinG,Cas9_archaeal,RT,csf1gr8,csf4gr11,csf2gr7,csf3gr5,PD-DExK	NA|115aa|up_3|NZ_CP011295.1_3999149_3999494_-,NA	NA|483aa|up_9|NZ_CP011295.1_3993351_3994800_+	COG2321, COG2321, Predicted metalloprotease [General function prediction only]	NA|299aa|up_8|NZ_CP011295.1_3994818_3995715_-	COG3118, COG3118, Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]	NA|121aa|up_7|NZ_CP011295.1_3995820_3996183_-	pfam12823, DUF3817, Domain of unknown function (DUF3817)	NA|398aa|up_6|NZ_CP011295.1_3996283_3997477_-	PRK05790, PRK05790, putative acyltransferase; Provisional	NA|158aa|up_5|NZ_CP011295.1_3997584_3998058_+	cd07249, MMCE, Methylmalonyl-CoA epimerase (MMCE)	NA|348aa|up_4|NZ_CP011295.1_3998098_3999142_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|115aa|up_3|NZ_CP011295.1_3999149_3999494_-	NA	NA|225aa|up_2|NZ_CP011295.1_3999513_4000188_-	PRK03298, PRK03298, endonuclease NucS	NA|531aa|up_1|NZ_CP011295.1_4000230_4001823_+	cd07302, CHD, cyclase homology domain	NA|209aa|up_0|NZ_CP011295.1_4001965_4002592_+	COG0262, FolA, Dihydrofolate reductase [Coenzyme metabolism]	NA|420aa|down_0|NZ_CP011295.1_4008763_4010023_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|191aa|down_1|NZ_CP011295.1_4010077_4010650_+	pfam01923, Cob_adeno_trans, Cobalamin adenosyltransferase	NA|141aa|down_2|NZ_CP011295.1_4010660_4011083_-	pfam10739, DUF2550, Protein of unknown function (DUF2550)	NA|123aa|down_3|NZ_CP011295.1_4011132_4011501_-	PRK00571, atpC, F0F1 ATP synthase subunit epsilon; Validated	NA|484aa|down_4|NZ_CP011295.1_4011507_4012959_-	PRK09280, PRK09280, F0F1 ATP synthase subunit beta; Validated	NA|328aa|down_5|NZ_CP011295.1_4012962_4013946_-	PRK05621, PRK05621, F0F1 ATP synthase subunit gamma; Validated	NA|548aa|down_6|NZ_CP011295.1_4013994_4015638_-	PRK09281, PRK09281, F0F1 ATP synthase subunit alpha; Validated	NA|275aa|down_7|NZ_CP011295.1_4015700_4016525_-	PRK13430, PRK13430, F0F1 ATP synthase subunit delta; Provisional	NA|187aa|down_8|NZ_CP011295.1_4016530_4017091_-	PRK05759, PRK05759, F0F1 ATP synthase subunit B; Validated	NA|81aa|down_9|NZ_CP011295.1_4017098_4017341_-	PRK07874, PRK07874, ATP synthase F0 subunit C
