assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010725885.1_ASM1072588v1	NZ_AP022600	Mycolicibacterium tokaiense strain JCM 6373	1	348526-348606	1	CRISPRCasFinder	no		DinG,cas3,casR,DEDDh,csa3,WYL	Orphan	CTCAGCCCTTCGGACGCTGCCCG	23	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,casR,DEDDh,csa3,WYL	NA|161aa|up_7|NZ_AP022600.1_342297_342780_+,NA|340aa|down_1|NZ_AP022600.1_351026_352046_+	NA|300aa|up_9|NZ_AP022600.1_339745_340645_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|546aa|up_8|NZ_AP022600.1_340641_342279_+	COG3559, TnrB3, Putative exporter of polyketide antibiotics [Cell envelope biogenesis, outer membrane]	NA|161aa|up_7|NZ_AP022600.1_342297_342780_+	NA	NA|405aa|up_6|NZ_AP022600.1_342781_343996_-	PRK00844, glgC, glucose-1-phosphate adenylyltransferase; Provisional	NA|385aa|up_5|NZ_AP022600.1_344162_345317_+	TIGR02149, glgA_Coryne, glycogen synthase, Corynebacterium family	NA|56aa|up_4|NZ_AP022600.1_345366_345534_-	pfam11314, DUF3117, Protein of unknown function (DUF3117)	NA|189aa|up_3|NZ_AP022600.1_345657_346224_-	pfam03352, Adenine_glyco, Methyladenine glycosylase	NA|106aa|up_2|NZ_AP022600.1_346216_346534_-	TIGR03544, cell_division_initiation_protein_DivIVA, DivIVA domain	NA|318aa|up_1|NZ_AP022600.1_346617_347571_-	PRK13915, PRK13915, putative glucosyl-3-phosphoglycerate synthase; Provisional	NA|292aa|up_0|NZ_AP022600.1_347567_348443_-	TIGR01496, Dihydropteroate_synthase, dihydropteroate synthase	NA|192aa|down_0|NZ_AP022600.1_350439_351015_-	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein	NA|340aa|down_1|NZ_AP022600.1_351026_352046_+	NA	NA|280aa|down_2|NZ_AP022600.1_352017_352857_+	pfam17765, MLTR_LBD, MmyB-like transcription regulator ligand binding domain	NA|309aa|down_3|NZ_AP022600.1_352807_353734_-	TIGR03560, F420_Rv1855c, probable F420-dependent oxidoreductase, Rv1855c family	NA|202aa|down_4|NZ_AP022600.1_353863_354469_+	COG2080, CoxS, Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion]	NA|326aa|down_5|NZ_AP022600.1_354470_355448_+	COG1319, CoxM, Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs [Energy production and conversion]	NA|719aa|down_6|NZ_AP022600.1_355444_357601_+	COG1529, CoxL, Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [Energy production and conversion]	NA|359aa|down_7|NZ_AP022600.1_357578_358655_-	PRK13007, PRK13007, succinyl-diaminopimelate desuccinylase; Reviewed	NA|318aa|down_8|NZ_AP022600.1_358693_359647_+	TIGR03535, DapD_actino, 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase	NA|115aa|down_9|NZ_AP022600.1_359738_360083_-	pfam18029, Glyoxalase_6, Glyoxalase-like domain
GCF_010725885.1_ASM1072588v1	NZ_AP022600	Mycolicibacterium tokaiense strain JCM 6373	2	1890051-1890208	2	CRISPRCasFinder	no		DinG,cas3,casR,DEDDh,csa3,WYL	Orphan	GTGCTCAGCGCGGGCGTGGTGAG	23	0	0	NA	NA	NA	3	3	Orphan	DinG,cas3,casR,DEDDh,csa3,WYL	NA|114aa|up_5|NZ_AP022600.1_1883489_1883831_-,NA|632aa|down_4|NZ_AP022600.1_1895383_1897279_+,NA|146aa|down_8|NZ_AP022600.1_1900600_1901038_+	NA|304aa|up_9|NZ_AP022600.1_1878488_1879400_+	cd08412, PBP2_PAO1_like, The C-terminal substrate-binding domain of putative LysR-type transcriptional regulator PAO1-like, a member of the type 2 periplasmic binding fold protein superfamily	NA|421aa|up_8|NZ_AP022600.1_1879548_1880811_+	cd17369, MFS_ShiA_like, Shikimate transporter and similar proteins of the Major Facilitator Superfamily	NA|413aa|up_7|NZ_AP022600.1_1880838_1882077_+	cd03884, M20_bAS, M20 Peptidase beta-alanine synthase, an amidohydrolase	NA|474aa|up_6|NZ_AP022600.1_1882073_1883495_+	COG0154, GatA, Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases [Translation, ribosomal structure and biogenesis]	NA|114aa|up_5|NZ_AP022600.1_1883489_1883831_-	NA	NA|517aa|up_4|NZ_AP022600.1_1883827_1885378_-	COG2220, COG2220, Predicted Zn-dependent hydrolases of the beta-lactamase fold [General function prediction only]	NA|255aa|up_3|NZ_AP022600.1_1885497_1886262_+	COG0204, PlsC, 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism]	NA|246aa|up_2|NZ_AP022600.1_1886268_1887006_+	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|275aa|up_1|NZ_AP022600.1_1887002_1887827_+	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	NA|536aa|up_0|NZ_AP022600.1_1887830_1889438_-	COG5479, COG5479, Uncharacterized protein potentially involved in peptidoglycan biosynthesis [Cell envelope biogenesis, outer membrane]	NA|400aa|down_0|NZ_AP022600.1_1890784_1891984_+	COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis, outer membrane]	NA|646aa|down_1|NZ_AP022600.1_1891980_1893918_+	pfam17994, Glft2_N, Galactofuranosyltransferase 2 N-terminal	NA|175aa|down_2|NZ_AP022600.1_1893910_1894435_+	cd01610, PAP2_like, PAP2_like proteins, a super-family of histidine phosphatases and vanadium haloperoxidases, includes type 2 phosphatidic acid phosphatase or lipid phosphate phosphatase (LPP), Glucose-6-phosphatase, Phosphatidylglycerophosphatase B and bacterial acid phosphatase, vanadium chloroperoxidases, vanadium bromoperoxidases, and several other mostly uncharacterized subfamilies	NA|306aa|down_3|NZ_AP022600.1_1894431_1895349_+	PRK12324, PRK12324, decaprenyl-phosphate phosphoribosyltransferase	NA|632aa|down_4|NZ_AP022600.1_1895383_1897279_+	NA	NA|305aa|down_5|NZ_AP022600.1_1897457_1898372_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|351aa|down_6|NZ_AP022600.1_1898511_1899564_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|287aa|down_7|NZ_AP022600.1_1899673_1900534_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|146aa|down_8|NZ_AP022600.1_1900600_1901038_+	NA	NA|339aa|down_9|NZ_AP022600.1_1901060_1902077_+	pfam01083, Cutinase, Cutinase
GCF_010725885.1_ASM1072588v1	NZ_AP022600	Mycolicibacterium tokaiense strain JCM 6373	3	3271792-3271897	3	CRISPRCasFinder	no		DinG,cas3,casR,DEDDh,csa3,WYL	Orphan	GACATGTGTGCGGTTTTGCGGGGGACACGCCGC	33	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,casR,DEDDh,csa3,WYL	NA,NA	NA|394aa|up_9|NZ_AP022600.1_3259779_3260961_+	TIGR03962, mycofact_rSAM, mycofactocin radical SAM maturase	NA|392aa|up_8|NZ_AP022600.1_3260964_3262140_+	TIGR03966, actino_HemFlav, heme/flavin dehydrogenase, mycofactocin system	NA|243aa|up_7|NZ_AP022600.1_3262241_3262970_+	TIGR03964, uncharacterized_protein_putative_amidase, mycofactocin system creatininase family protein	NA|471aa|up_6|NZ_AP022600.1_3262996_3264409_+	TIGR03965, glycosyltransferase_Rv0696_family, mycofactocin system glycosyltransferase	NA|479aa|up_5|NZ_AP022600.1_3264405_3265842_+	TIGR03970, Rv0697, dehydrogenase, Rv0697 family	NA|220aa|up_4|NZ_AP022600.1_3265838_3266498_-	COG4126, COG4126, Hydantoin racemase [Amino acid transport and metabolism]	NA|533aa|up_3|NZ_AP022600.1_3266507_3268106_-	PRK14995, PRK14995, SmvA family efflux MFS transporter	NA|212aa|up_2|NZ_AP022600.1_3268206_3268842_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|421aa|up_1|NZ_AP022600.1_3268807_3270070_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|513aa|up_0|NZ_AP022600.1_3270234_3271773_+	pfam00135, COesterase, Carboxylesterase family	NA|102aa|down_0|NZ_AP022600.1_3272275_3272581_+	PRK00596, rpsJ, 30S ribosomal protein S10; Reviewed	NA|218aa|down_1|NZ_AP022600.1_3272595_3273249_+	PRK00001, rplC, 50S ribosomal protein L3; Validated	NA|216aa|down_2|NZ_AP022600.1_3273251_3273899_+	PRK05319, rplD, 50S ribosomal protein L4; Provisional	NA|101aa|down_3|NZ_AP022600.1_3273898_3274201_+	PRK05738, rplW, 50S ribosomal protein L23; Reviewed	NA|279aa|down_4|NZ_AP022600.1_3274222_3275059_+	PRK09374, rplB, 50S ribosomal protein L2; Validated	NA|94aa|down_5|NZ_AP022600.1_3275077_3275359_+	PRK00357, rpsS, 30S ribosomal protein S19; Reviewed	NA|153aa|down_6|NZ_AP022600.1_3275358_3275817_+	PRK00565, rplV, 50S ribosomal protein L22; Reviewed	NA|279aa|down_7|NZ_AP022600.1_3275816_3276653_+	PRK00310, rpsC, 30S ribosomal protein S3; Reviewed	NA|139aa|down_8|NZ_AP022600.1_3276655_3277072_+	PRK09203, rplP, 50S ribosomal protein L16; Reviewed	NA|78aa|down_9|NZ_AP022600.1_3277071_3277305_+	PRK00306, PRK00306, 50S ribosomal protein L29; Reviewed
GCF_010725885.1_ASM1072588v1	NZ_AP022600	Mycolicibacterium tokaiense strain JCM 6373	4	4428825-4428931	4	CRISPRCasFinder	no		DinG,cas3,casR,DEDDh,csa3,WYL	Orphan	GGCACACATGTGTGCTCAGCTCGG	24	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,casR,DEDDh,csa3,WYL	NA,NA	NA|258aa|up_9|NZ_AP022600.1_4413849_4414623_+	pfam02720, DUF222, Domain of unknown function (DUF222)	NA|279aa|up_8|NZ_AP022600.1_4414635_4415472_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|310aa|up_7|NZ_AP022600.1_4415630_4416560_+	pfam04087, DUF389, Domain of unknown function (DUF389)	NA|952aa|up_6|NZ_AP022600.1_4419829_4422685_+	COG2308, COG2308, Uncharacterized conserved protein [Function unknown]	NA|304aa|up_5|NZ_AP022600.1_4422681_4423593_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|358aa|up_4|NZ_AP022600.1_4423585_4424659_+	COG4307, COG4307, Uncharacterized protein conserved in bacteria [Function unknown]	NA|394aa|up_3|NZ_AP022600.1_4424803_4425985_-	cd08283, FDH_like_1, Glutathione-dependent formaldehyde dehydrogenase related proteins, child 1	NA|196aa|up_2|NZ_AP022600.1_4425994_4426582_-	smart00880, CHAD, The CHAD domain is an alpha-helical domain functionally associated with some members of the adenylate cyclase family	NA|281aa|up_1|NZ_AP022600.1_4426575_4427418_+	pfam11296, DUF3097, Protein of unknown function (DUF3097)	NA|445aa|up_0|NZ_AP022600.1_4427469_4428804_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|96aa|down_0|NZ_AP022600.1_4429070_4429358_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|134aa|down_1|NZ_AP022600.1_4429473_4429875_+	COG0432, COG0432, Uncharacterized conserved protein [Function unknown]	NA|897aa|down_2|NZ_AP022600.1_4429961_4432652_+	PRK00252, alaS, alanyl-tRNA synthetase; Reviewed	NA|179aa|down_3|NZ_AP022600.1_4432658_4433195_+	PRK00109, PRK00109, Holliday junction resolvase RuvX	NA|414aa|down_4|NZ_AP022600.1_4433187_4434429_+	COG1559, COG1559, Aminodeoxychorismate lyase [Coenzyme transport and metabolism]	NA|267aa|down_5|NZ_AP022600.1_4434418_4435219_+	PRK00258, aroE, shikimate 5-dehydrogenase; Reviewed	NA|131aa|down_6|NZ_AP022600.1_4435228_4435621_+	pfam01478, Peptidase_A24, Type IV leader peptidase family	NA|466aa|down_7|NZ_AP022600.1_4435621_4437019_-	cd17504, MFS_MMR_MDR_like, Methylenomycin A resistance protein (also called MMR peptide)-like multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|399aa|down_8|NZ_AP022600.1_4437091_4438288_+	PRK05382, PRK05382, chorismate synthase; Validated	NA|229aa|down_9|NZ_AP022600.1_4438305_4438992_+	PRK00131, aroK, shikimate kinase; Reviewed
GCF_010725885.1_ASM1072588v1	NZ_AP022600	Mycolicibacterium tokaiense strain JCM 6373	5	6062942-6063041	5	CRISPRCasFinder	no	csa3	DinG,cas3,casR,DEDDh,csa3,WYL	Type I-A	GTGTGCGGTTTACTCCGCGACACGCCG	27	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,casR,DEDDh,csa3,WYL	NA|221aa|up_3|NZ_AP022600.1_6060857_6061520_+,NA|104aa|up_0|NZ_AP022600.1_6062470_6062782_+,NA|79aa|down_0|NZ_AP022600.1_6063174_6063411_+,NA|110aa|down_1|NZ_AP022600.1_6063412_6063742_-	NA|682aa|up_9|NZ_AP022600.1_6052595_6054641_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|149aa|up_8|NZ_AP022600.1_6054705_6055152_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|400aa|up_7|NZ_AP022600.1_6055216_6056416_+	cd17474, MFS_YfmO_like, Bacillus subtilis multidrug efflux protein YfmO and similar transporters of the Major Facilitator Superfamily	NA|352aa|up_6|NZ_AP022600.1_6056948_6058004_-	pfam13449, Phytase-like, Esterase-like activity of phytase	NA|238aa|up_5|NZ_AP022600.1_6058034_6058748_-	pfam01988, VIT1, VIT family	NA|646aa|up_4|NZ_AP022600.1_6058824_6060762_+	cd01150, AXO, Peroxisomal acyl-CoA oxidase	NA|221aa|up_3|NZ_AP022600.1_6060857_6061520_+	NA	NA|190aa|up_2|NZ_AP022600.1_6061516_6062086_+	cd02862, NorE_like, NorE_like subfamily of heme-copper oxidase subunit III	NA|90aa|up_1|NZ_AP022600.1_6062085_6062355_+	pfam03626, COX4_pro, Prokaryotic Cytochrome C oxidase subunit IV	NA|104aa|up_0|NZ_AP022600.1_6062470_6062782_+	NA	NA|79aa|down_0|NZ_AP022600.1_6063174_6063411_+	NA	NA|110aa|down_1|NZ_AP022600.1_6063412_6063742_-	NA	NA|631aa|down_2|NZ_AP022600.1_6063808_6065701_-	PRK05667, dnaG, DNA primase; Validated	NA|421aa|down_3|NZ_AP022600.1_6065705_6066968_-	PRK03007, PRK03007, deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional	NA|677aa|down_4|NZ_AP022600.1_6067038_6069069_+	pfam04536, TPM_phosphatase, TPM domain	csa3|125aa|down_5|NZ_AP022600.1_6069259_6069634_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|550aa|down_6|NZ_AP022600.1_6069630_6071280_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|462aa|down_7|NZ_AP022600.1_6071266_6072652_-	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional	csa3|118aa|down_8|NZ_AP022600.1_6072805_6073159_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|134aa|down_9|NZ_AP022600.1_6073155_6073557_+	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]
GCF_010725885.1_ASM1072588v1	NZ_AP022600	Mycolicibacterium tokaiense strain JCM 6373	6	6157466-6157549	6	CRISPRCasFinder	no		DinG,cas3,casR,DEDDh,csa3,WYL	Orphan	CACCGCCTGCTGTTGTTGCTGCTG	24	0	0	NA	NA	NA	1	1	Orphan	DinG,cas3,casR,DEDDh,csa3,WYL	NA,NA	NA|217aa|up_9|NZ_AP022600.1_6148897_6149548_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|222aa|up_8|NZ_AP022600.1_6149552_6150218_-	TIGR03968, transcriptional_regulator_TetR_family, mycofactocin system transcriptional regulator	NA|384aa|up_7|NZ_AP022600.1_6150289_6151441_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|173aa|up_6|NZ_AP022600.1_6151463_6151982_-	COG0456, RimI, Acetyltransferases [General function prediction only]	NA|143aa|up_5|NZ_AP022600.1_6152080_6152509_-	cd04623, CBS_pair_bac_euk, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria and eukaryotes	NA|328aa|up_4|NZ_AP022600.1_6152644_6153628_+	cd19076, AKR_AKR13A_13D, AKR13A and AKR13D families of aldo-keto reductase (AKR)	NA|278aa|up_3|NZ_AP022600.1_6153665_6154499_+	PRK00055, PRK00055, ribonuclease Z; Reviewed	NA|68aa|up_2|NZ_AP022600.1_6155331_6155535_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|280aa|up_1|NZ_AP022600.1_6155642_6156482_-	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|326aa|up_0|NZ_AP022600.1_6156482_6157460_-	COG2307, COG2307, Uncharacterized protein conserved in bacteria [Function unknown]	NA|87aa|down_0|NZ_AP022600.1_6159354_6159615_+	PRK00239, rpsT, 30S ribosomal protein S20; Reviewed	NA|329aa|down_1|NZ_AP022600.1_6159688_6160675_-	PRK07914, PRK07914, hypothetical protein; Reviewed	NA|499aa|down_2|NZ_AP022600.1_6160687_6162184_-	pfam03772, Competence, Competence protein	NA|262aa|down_3|NZ_AP022600.1_6162210_6162996_-	COG1555, ComEA, DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]	NA|348aa|down_4|NZ_AP022600.1_6163143_6164187_+	COG3804, COG3804, Uncharacterized conserved protein related to dihydrodipicolinate reductase [Function unknown]	NA|280aa|down_5|NZ_AP022600.1_6164176_6165016_-	TIGR00762, DegV, EDD domain protein, DegV family	NA|241aa|down_6|NZ_AP022600.1_6165018_6165741_-	cd00229, SGNH_hydrolase, SGNH_hydrolase, or GDSL_hydrolase, is a diverse family of lipases and esterases	NA|219aa|down_7|NZ_AP022600.1_6165730_6166387_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|129aa|down_8|NZ_AP022600.1_6166413_6166800_-	COG0799, COG0799, Uncharacterized homolog of plant Iojap protein [Function unknown]	NA|214aa|down_9|NZ_AP022600.1_6166796_6167438_-	PRK00071, nadD, nicotinate-nucleotide adenylyltransferase
