assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000266905.1_ASM26690v1	NC_018027	Mycolicibacterium chubuense NBB4, complete sequence	1	1431927-1432038	1	CRISPRCasFinder	no	cas3	csa3,WYL,cas3,casR,cas4,DEDDh,DinG	Unclear	CCTCGCCGCTTGATCGTCACCCTCCGAG	28	0	0	NA	NA	NA	1	1	Unclear	csa3,WYL,cas3,casR,cas4,DEDDh,DinG	NA|137aa|up_4|NC_018027.1_1428480_1428891_+,NA	NA|72aa|up_9|NC_018027.1_1424615_1424831_+	PRK05889, PRK05889, biotin/lipoyl-binding carrier protein	NA|497aa|up_8|NC_018027.1_1424877_1426368_+	COG3920, COG3920, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|85aa|up_7|NC_018027.1_1426489_1426744_-	pfam02467, Whib, Transcription factor WhiB	NA|319aa|up_6|NC_018027.1_1427011_1427968_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|148aa|up_5|NC_018027.1_1428040_1428484_+	PRK05395, PRK05395, type II 3-dehydroquinate dehydratase	NA|137aa|up_4|NC_018027.1_1428480_1428891_+	NA	NA|143aa|up_3|NC_018027.1_1428842_1429271_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|366aa|up_2|NC_018027.1_1429330_1430428_-	COG1169, MenF, Isochorismate synthase [Coenzyme metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|204aa|up_1|NC_018027.1_1430424_1431036_-	PRK13462, PRK13462, acid phosphatase; Provisional	NA|268aa|up_0|NC_018027.1_1431117_1431921_+	pfam13614, AAA_31, AAA domain	cas3|494aa|down_0|NC_018027.1_1433247_1434729_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|235aa|down_1|NC_018027.1_1434997_1435702_+	pfam13794, MiaE_2, tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE)-like	NA|285aa|down_2|NC_018027.1_1435705_1436560_-	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|84aa|down_3|NC_018027.1_1436813_1437065_+	pfam11305, DUF3107, Protein of unknown function (DUF3107)	NA|227aa|down_4|NC_018027.1_1437092_1437773_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|342aa|down_5|NC_018027.1_1437964_1438990_+	pfam11350, DUF3152, Protein of unknown function (DUF3152)	NA|404aa|down_6|NC_018027.1_1439249_1440461_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|398aa|down_7|NC_018027.1_1440612_1441806_+	PRK07878, PRK07878, molybdopterin biosynthesis-like protein MoeZ; Validated	NA|285aa|down_8|NC_018027.1_1441842_1442697_+	TIGR02569, conserved_hypothetical_protein, TIGR02569 family protein	NA|99aa|down_9|NC_018027.1_1442707_1443004_-	COG3695, COG3695, Predicted methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]
GCF_000266905.1_ASM26690v1	NC_018027	Mycolicibacterium chubuense NBB4, complete sequence	2	1496067-1496143	2	CRISPRCasFinder	no		csa3,WYL,cas3,casR,cas4,DEDDh,DinG	Orphan	CTGGCTTCCTTGCGGGCCTGGCG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,cas3,casR,cas4,DEDDh,DinG	NA|255aa|up_9|NC_018027.1_1487244_1488009_-,NA|245aa|up_8|NC_018027.1_1488005_1488740_-,NA|94aa|down_2|NC_018027.1_1500351_1500633_-,NA|117aa|down_3|NC_018027.1_1500890_1501241_-	NA|255aa|up_9|NC_018027.1_1487244_1488009_-	NA	NA|245aa|up_8|NC_018027.1_1488005_1488740_-	NA	NA|205aa|up_7|NC_018027.1_1488995_1489610_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|435aa|up_6|NC_018027.1_1489697_1491002_+	pfam00135, COesterase, Carboxylesterase family	NA|298aa|up_5|NC_018027.1_1491032_1491926_-	cd13431, LDT_IgD_like_1, IgD-like repeat domain of mycobacterial L,D-transpeptidases	NA|151aa|up_4|NC_018027.1_1491982_1492435_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|145aa|up_3|NC_018027.1_1492590_1493025_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|560aa|up_2|NC_018027.1_1493025_1494705_+	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|134aa|up_1|NC_018027.1_1494701_1495103_-	pfam03928, Haem_degrading, Haem-degrading	NA|143aa|up_0|NC_018027.1_1495153_1495582_-	TIGR03618, Rv1155_F420, PPOX class probable F420-dependent enzyme	NA|278aa|down_0|NC_018027.1_1497897_1498731_-	pfam08924, DUF1906, Domain of unknown function (DUF1906)	NA|524aa|down_1|NC_018027.1_1498773_1500345_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|94aa|down_2|NC_018027.1_1500351_1500633_-	NA	NA|117aa|down_3|NC_018027.1_1500890_1501241_-	NA	NA|930aa|down_4|NC_018027.1_1501579_1504369_+	pfam18564, Glyco_hydro_5_C, Glycoside hydrolase family 5 C-terminal domain	NA|414aa|down_5|NC_018027.1_1504392_1505634_-	COG3174, COG3174, Predicted membrane protein [Function unknown]	NA|359aa|down_6|NC_018027.1_1505646_1506723_-	pfam00296, Bac_luciferase, Luciferase-like monooxygenase	NA|867aa|down_7|NC_018027.1_1506812_1509413_-	cd02080, P-type_ATPase_cation, P-type cation-transporting ATPase similar to Exiguobacterium aurantiacum Mna, an Na(+)-ATPase, and Synechocystis sp	NA|466aa|down_8|NC_018027.1_1510030_1511428_+	COG1541, PaaK, Coenzyme F390 synthetase [Coenzyme metabolism]	NA|449aa|down_9|NC_018027.1_1511448_1512795_-	cd07724, POD-like_MBL-fold, ETHE1 (PDO type I), persulfide dioxygenase A (PDOA, PDO type II) and related proteins; MBL-fold metallo-hydrolase domain
GCF_000266905.1_ASM26690v1	NC_018027	Mycolicibacterium chubuense NBB4, complete sequence	3	2279211-2279309	3	CRISPRCasFinder	no		csa3,WYL,cas3,casR,cas4,DEDDh,DinG	Orphan	CAATGCGCCTGACGTCCCGGGTGTGCCTGACACCCC	36	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,cas3,casR,cas4,DEDDh,DinG	NA|61aa|up_9|NC_018027.1_2270267_2270450_-,NA|62aa|up_8|NC_018027.1_2270580_2270766_-,NA|61aa|up_7|NC_018027.1_2270855_2271038_-,NA|106aa|up_6|NC_018027.1_2271153_2271471_-,NA|142aa|up_5|NC_018027.1_2271910_2272336_-,NA|157aa|down_7|NC_018027.1_2287434_2287905_-,NA|78aa|down_9|NC_018027.1_2288882_2289116_-	NA|61aa|up_9|NC_018027.1_2270267_2270450_-	NA	NA|62aa|up_8|NC_018027.1_2270580_2270766_-	NA	NA|61aa|up_7|NC_018027.1_2270855_2271038_-	NA	NA|106aa|up_6|NC_018027.1_2271153_2271471_-	NA	NA|142aa|up_5|NC_018027.1_2271910_2272336_-	NA	NA|206aa|up_4|NC_018027.1_2272864_2273482_+	TIGR03083, TIGR03083, uncharacterized Actinobacterial protein TIGR03083	NA|90aa|up_3|NC_018027.1_2273838_2274108_-	COG3462, COG3462, Predicted membrane protein [Function unknown]	NA|501aa|up_2|NC_018027.1_2274157_2275660_-	COG2132, SufI, Putative multicopper oxidases [Secondary metabolites biosynthesis, transport, and catabolism]	NA|256aa|up_1|NC_018027.1_2275808_2276576_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|385aa|up_0|NC_018027.1_2276572_2277727_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|230aa|down_0|NC_018027.1_2279524_2280214_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|486aa|down_1|NC_018027.1_2280174_2281632_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|243aa|down_2|NC_018027.1_2281662_2282391_-	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|216aa|down_3|NC_018027.1_2282595_2283243_+	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|216aa|down_4|NC_018027.1_2283362_2284010_+	pfam09203, MspA, MspA	NA|414aa|down_5|NC_018027.1_2284281_2285523_-	COG4705, COG4705, Uncharacterized membrane-anchored protein conserved in bacteria [Function unknown]	NA|388aa|down_6|NC_018027.1_2286109_2287273_-	pfam15308, CEP170_C, CEP170 C-terminus	NA|157aa|down_7|NC_018027.1_2287434_2287905_-	NA	NA|264aa|down_8|NC_018027.1_2288065_2288857_+	pfam14023, DUF4239, Protein of unknown function (DUF4239)	NA|78aa|down_9|NC_018027.1_2288882_2289116_-	NA
GCF_000266905.1_ASM26690v1	NC_018027	Mycolicibacterium chubuense NBB4, complete sequence	4	2336705-2337370	1	CRT	no		csa3,WYL,cas3,casR,cas4,DEDDh,DinG	Orphan	CGACGCCGCCNGCGCCGGGTGCNCCGGGTGNNCCGGTGG	39	3	5	2336744-2336761|2337200-2337217|2337200-2337217|2337314-2337331|2337314-2337331	NC_018027.1_2336630-2336647|NC_018027.1_5233197-5233180|NC_018027.1_5496239-5496256|NC_018027.1_5233197-5233180|NC_018027.1_5496239-5496256	NA	11	11	Orphan	csa3,WYL,cas3,casR,cas4,DEDDh,DinG	NA|151aa|up_1|NC_018027.1_2334773_2335226_-,NA	NA|311aa|up_9|NC_018027.1_2324974_2325907_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|437aa|up_8|NC_018027.1_2325903_2327214_-	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|552aa|up_7|NC_018027.1_2327489_2329145_+	cd11332, AmyAc_OligoGlu_TS, Alpha amylase catalytic domain found in oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), trehalose synthase (also called maltose alpha-D-glucosyltransferase), and related proteins	NA|328aa|up_6|NC_018027.1_2329241_2330225_+	PRK07877, PRK07877, Rv1355c family protein	NA|217aa|up_5|NC_018027.1_2330241_2330892_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|275aa|up_4|NC_018027.1_2330925_2331750_-	pfam00582, Usp, Universal stress protein family	NA|332aa|up_3|NC_018027.1_2331743_2332739_-	PRK07877, PRK07877, Rv1355c family protein	NA|552aa|up_2|NC_018027.1_2332909_2334565_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|151aa|up_1|NC_018027.1_2334773_2335226_-	NA	NA|231aa|up_0|NC_018027.1_2335273_2335966_+	pfam02956, TT_ORF1, TT viral orf 1	NA|234aa|down_0|NC_018027.1_2338895_2339597_-	pfam03861, ANTAR, ANTAR domain	NA|166aa|down_1|NC_018027.1_2339736_2340234_-	pfam09348, DUF1990, Domain of unknown function (DUF1990)	NA|291aa|down_2|NC_018027.1_2340272_2341145_+	cd19088, AKR_AKR13B1, AKR13B family of aldo-keto reductase (AKR)	NA|210aa|down_3|NC_018027.1_2341166_2341796_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|146aa|down_4|NC_018027.1_2341792_2342230_+	TIGR02611, Putative_membrane_protein, TIGR02611 family protein	NA|692aa|down_5|NC_018027.1_2342319_2344395_+	PRK12305, thrS, threonyl-tRNA synthetase; Reviewed	NA|183aa|down_6|NC_018027.1_2344396_2344945_+	cd01275, FHIT, FHIT (fragile histidine family): FHIT proteins, related to the HIT family carry a motif HxHxH/Qxx (x, is a hydrophobic amino acid), On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified into three  branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases	NA|224aa|down_7|NC_018027.1_2344941_2345613_+	COG0558, PgsA, Phosphatidylglycerophosphate synthase [Lipid metabolism]	NA|319aa|down_8|NC_018027.1_2345609_2346566_+	PRK07920, PRK07920, lipid A biosynthesis lauroyl acyltransferase; Provisional	NA|377aa|down_9|NC_018027.1_2346565_2347696_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase
