assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010730355.1_ASM1073035v1	NZ_AP022576	Mycobacterium florentinum strain JCM 14740	1	2238288-2238371	1	CRISPRCasFinder	no	cas3	DEDDh,cas4,WYL,csa3,cas3,DinG	Unclear	AGCCGGGCGACGATGCGCGTCGTCACGA	28	0	0	NA	NA	NA	1	1	Unclear	DEDDh,cas4,WYL,csa3,cas3,DinG	NA,NA	NA|85aa|up_9|NZ_AP022576.1_2225670_2225925_+	TIGR02200, conserved_hypothetical_protein, Glutaredoxin-like protein	NA|221aa|up_8|NZ_AP022576.1_2225949_2226612_+	pfam14032, PknH_C, PknH-like extracellular domain	NA|308aa|up_7|NZ_AP022576.1_2226616_2227540_-	PRK00241, nudC, NAD(+) diphosphatase	NA|357aa|up_6|NZ_AP022576.1_2227539_2228610_-	pfam02254, TrkA_N, TrkA-N domain	NA|1110aa|up_5|NZ_AP022576.1_2228699_2232029_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|1035aa|up_4|NZ_AP022576.1_2232025_2235130_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|262aa|up_3|NZ_AP022576.1_2235210_2235996_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|102aa|up_2|NZ_AP022576.1_2235999_2236305_+	COG3695, COG3695, Predicted methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|331aa|up_1|NZ_AP022576.1_2236431_2237424_+	pfam00665, rve, Integrase core domain	NA|285aa|up_0|NZ_AP022576.1_2237420_2238275_-	TIGR02569, conserved_hypothetical_protein, TIGR02569 family protein	NA|398aa|down_0|NZ_AP022576.1_2238383_2239577_-	PRK07878, PRK07878, molybdopterin biosynthesis-like protein MoeZ; Validated	NA|286aa|down_1|NZ_AP022576.1_2239750_2240608_-	pfam11350, DUF3152, Protein of unknown function (DUF3152)	NA|229aa|down_2|NZ_AP022576.1_2240966_2241653_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|90aa|down_3|NZ_AP022576.1_2241639_2241909_-	pfam11305, DUF3107, Protein of unknown function (DUF3107)	NA|281aa|down_4|NZ_AP022576.1_2242036_2242879_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|232aa|down_5|NZ_AP022576.1_2242882_2243578_-	pfam13794, MiaE_2, tRNA-(MS[2]IO[6]A)-hydroxylase (MiaE)-like	cas3|502aa|down_6|NZ_AP022576.1_2243830_2245336_+	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|407aa|down_7|NZ_AP022576.1_2245350_2246571_+	TIGR03300, assembly_YfgL, outer membrane assembly lipoprotein YfgL	NA|267aa|down_8|NZ_AP022576.1_2246585_2247386_-	pfam13614, AAA_31, AAA domain	NA|199aa|down_9|NZ_AP022576.1_2247461_2248058_+	PRK13462, PRK13462, acid phosphatase; Provisional
GCF_010730355.1_ASM1073035v1	NZ_AP022576	Mycobacterium florentinum strain JCM 14740	2	3097106-3097226	2	CRISPRCasFinder	no		DEDDh,cas4,WYL,csa3,cas3,DinG	Orphan	CCCGGCCACGGCGCTCCCGGCAGCGGACCAA	31	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas4,WYL,csa3,cas3,DinG	NA|197aa|up_1|NZ_AP022576.1_3094198_3094789_-,NA|125aa|up_0|NZ_AP022576.1_3095077_3095452_-,NA|214aa|down_1|NZ_AP022576.1_3099728_3100370_-,NA|201aa|down_9|NZ_AP022576.1_3108179_3108782_+	NA|424aa|up_9|NZ_AP022576.1_3084942_3086214_+	pfam07907, YibE_F, YibE/F-like protein	NA|289aa|up_8|NZ_AP022576.1_3086164_3087031_-	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|122aa|up_7|NZ_AP022576.1_3087048_3087414_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|252aa|up_6|NZ_AP022576.1_3087466_3088222_-	TIGR03083, TIGR03083, uncharacterized Actinobacterial protein TIGR03083	NA|442aa|up_5|NZ_AP022576.1_3088251_3089577_-	COG1004, Ugd, Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|510aa|up_4|NZ_AP022576.1_3089668_3091198_-	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|705aa|up_3|NZ_AP022576.1_3091367_3093482_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|191aa|up_2|NZ_AP022576.1_3093591_3094164_-	PRK00416, dcd, deoxycytidine triphosphate deaminase; Reviewed	NA|197aa|up_1|NZ_AP022576.1_3094198_3094789_-	NA	NA|125aa|up_0|NZ_AP022576.1_3095077_3095452_-	NA	NA|268aa|down_0|NZ_AP022576.1_3097816_3098620_+	TIGR03971, short-chain_dehydrogenase/reductase_SDR, SDR family mycofactocin-dependent oxidoreductase	NA|214aa|down_1|NZ_AP022576.1_3099728_3100370_-	NA	NA|285aa|down_2|NZ_AP022576.1_3100541_3101396_+	COG1376, ErfK, Uncharacterized protein conserved in bacteria [Function unknown]	NA|181aa|down_3|NZ_AP022576.1_3101588_3102131_-	PRK14954, PRK14954, DNA polymerase III subunits gamma and tau; Provisional	NA|271aa|down_4|NZ_AP022576.1_3102318_3103131_-	pfam04240, Caroten_synth, Carotenoid biosynthesis protein	NA|257aa|down_5|NZ_AP022576.1_3103316_3104087_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|509aa|down_6|NZ_AP022576.1_3104110_3105637_-	PRK02106, PRK02106, choline dehydrogenase; Validated	NA|242aa|down_7|NZ_AP022576.1_3105739_3106465_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|497aa|down_8|NZ_AP022576.1_3106466_3107957_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|201aa|down_9|NZ_AP022576.1_3108179_3108782_+	NA
GCF_010730355.1_ASM1073035v1	NZ_AP022576	Mycobacterium florentinum strain JCM 14740	3	3115569-3115713	3	CRISPRCasFinder	no		DEDDh,cas4,WYL,csa3,cas3,DinG	Orphan	GGCCCCGGCGGTTTCGGGGGCGGCT	25	1	1	3115594-3115610	NZ_AP022576.1_276506-276490	NA	2	2	Orphan	DEDDh,cas4,WYL,csa3,cas3,DinG	NA|201aa|up_8|NZ_AP022576.1_3108179_3108782_+,NA|102aa|up_5|NZ_AP022576.1_3111009_3111315_-,NA|76aa|up_4|NZ_AP022576.1_3111434_3111662_+,NA|129aa|down_3|NZ_AP022576.1_3121321_3121708_-,NA|410aa|down_5|NZ_AP022576.1_3123903_3125133_-	NA|497aa|up_9|NZ_AP022576.1_3106466_3107957_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|201aa|up_8|NZ_AP022576.1_3108179_3108782_+	NA	NA|230aa|up_7|NZ_AP022576.1_3108794_3109484_-	pfam11139, SfLAP, Sap, sulfolipid-1-addressing protein	NA|523aa|up_6|NZ_AP022576.1_3109428_3110997_+	PRK09228, PRK09228, guanine deaminase; Provisional	NA|102aa|up_5|NZ_AP022576.1_3111009_3111315_-	NA	NA|76aa|up_4|NZ_AP022576.1_3111434_3111662_+	NA	NA|496aa|up_3|NZ_AP022576.1_3111724_3113212_+	COG5305, COG5305, Predicted membrane protein [Function unknown]	NA|278aa|up_2|NZ_AP022576.1_3113212_3114046_-	cd08023, GH16_laminarinase_like, Laminarinase, member of the glycosyl hydrolase family 16	NA|276aa|up_1|NZ_AP022576.1_3114167_3114995_-	cd08023, GH16_laminarinase_like, Laminarinase, member of the glycosyl hydrolase family 16	NA|140aa|up_0|NZ_AP022576.1_3115007_3115427_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|340aa|down_0|NZ_AP022576.1_3116295_3117315_-	cd05288, PGDH, Prostaglandin dehydrogenases	NA|243aa|down_1|NZ_AP022576.1_3117362_3118091_+	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|1058aa|down_2|NZ_AP022576.1_3118095_3121269_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|129aa|down_3|NZ_AP022576.1_3121321_3121708_-	NA	NA|650aa|down_4|NZ_AP022576.1_3121799_3123749_-	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|410aa|down_5|NZ_AP022576.1_3123903_3125133_-	NA	NA|207aa|down_6|NZ_AP022576.1_3125171_3125792_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|166aa|down_7|NZ_AP022576.1_3125911_3126409_+	pfam13577, SnoaL_4, SnoaL-like domain	NA|222aa|down_8|NZ_AP022576.1_3126412_3127078_-	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|209aa|down_9|NZ_AP022576.1_3127152_3127779_-	cd03392, PAP2_like_2, PAP2_like_2 proteins
GCF_010730355.1_ASM1073035v1	NZ_AP022576	Mycobacterium florentinum strain JCM 14740	4	6115695-6115843	4	CRISPRCasFinder	no		DEDDh,cas4,WYL,csa3,cas3,DinG	Orphan	GGGCCACCGGGCCCGCCGGGACCACCTTGAGGTCCACCTGGGCCGCC	47	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas4,WYL,csa3,cas3,DinG	NA|219aa|up_5|NZ_AP022576.1_6109096_6109753_-,NA|71aa|down_4|NZ_AP022576.1_6121557_6121770_+	NA|602aa|up_9|NZ_AP022576.1_6094449_6096255_+	TIGR03104, trio_amidotrans, asparagine synthase family amidotransferase	NA|595aa|up_8|NZ_AP022576.1_6096251_6098036_+	TIGR03103, trio_acet_GNAT, GNAT-family acetyltransferase TIGR03103	NA|572aa|up_7|NZ_AP022576.1_6098039_6099755_-	cd06534, ALDH-SF, NAD(P)+-dependent aldehyde dehydrogenase superfamily	NA|3044aa|up_6|NZ_AP022576.1_6099871_6109003_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|219aa|up_5|NZ_AP022576.1_6109096_6109753_-	NA	NA|308aa|up_4|NZ_AP022576.1_6109944_6110868_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|432aa|up_3|NZ_AP022576.1_6110839_6112135_-	COG1819, COG1819, Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms]	NA|257aa|up_2|NZ_AP022576.1_6112423_6113194_-	pfam08241, Methyltransf_11, Methyltransferase domain	NA|211aa|up_1|NZ_AP022576.1_6113848_6114481_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|234aa|up_0|NZ_AP022576.1_6114496_6115198_-	pfam00300, His_Phos_1, Histidine phosphatase superfamily (branch 1)	NA|262aa|down_0|NZ_AP022576.1_6116441_6117227_+	pfam13622, 4HBT_3, Thioesterase-like superfamily	NA|519aa|down_1|NZ_AP022576.1_6117223_6118780_+	PRK13295, PRK13295, cyclohexanecarboxylate-CoA ligase; Reviewed	NA|207aa|down_2|NZ_AP022576.1_6118867_6119488_-	TIGR03085, TIGR03085, TIGR03085 family protein	NA|642aa|down_3|NZ_AP022576.1_6119559_6121485_+	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|71aa|down_4|NZ_AP022576.1_6121557_6121770_+	NA	NA|589aa|down_5|NZ_AP022576.1_6121792_6123559_-	pfam12077, DUF3556, Transmembrane protein of unknown function (DUF3556)	NA|434aa|down_6|NZ_AP022576.1_6123868_6125170_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|227aa|down_7|NZ_AP022576.1_6125123_6125804_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|360aa|down_8|NZ_AP022576.1_6125873_6126953_+	cd04730, NPD_like, 2-Nitropropane dioxygenase (NPD), one of the nitroalkane oxidizing enzyme families, catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites	NA|128aa|down_9|NZ_AP022576.1_6127088_6127472_+	cd06587, VOC, vicinal oxygen chelate (VOC) family
