assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009363295.1_ASM936329v1	NZ_CP045325	Mycobacterium sp. THAF192 chromosome, complete genome	1	215746-215878	1	CRISPRCasFinder	no		csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG	Orphan	GCCACGGCGGTGCTCTCGTCGCCG	24	1	1	215770-215793	NZ_CP045325.1_221970-221993	NA	2	2	Orphan	csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA|164aa|up_9|NZ_CP045325.1_201014_201506_-,NA|455aa|down_5|NZ_CP045325.1_229488_230853_+,NA|77aa|down_7|NZ_CP045325.1_233044_233275_+	NA|164aa|up_9|NZ_CP045325.1_201014_201506_-	NA	NA|976aa|up_8|NZ_CP045325.1_201502_204430_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|182aa|up_7|NZ_CP045325.1_204696_205242_+	TIGR04530, hemophoreRv0203, hemophore, mycobacterial-type	NA|129aa|up_6|NZ_CP045325.1_205431_205818_+	pfam16525, MHB, Haemophore, haem-binding	NA|223aa|up_5|NZ_CP045325.1_205902_206571_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|452aa|up_4|NZ_CP045325.1_206586_207942_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|123aa|up_3|NZ_CP045325.1_208026_208395_-	pfam11255, DUF3054, Protein of unknown function (DUF3054)	NA|417aa|up_2|NZ_CP045325.1_208381_209632_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|389aa|up_1|NZ_CP045325.1_209650_210817_-	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|384aa|up_0|NZ_CP045325.1_210894_212046_+	pfam01594, AI-2E_transport, AI-2E family transporter	NA|1994aa|down_0|NZ_CP045325.1_217606_223588_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|991aa|down_1|NZ_CP045325.1_223781_226754_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|239aa|down_2|NZ_CP045325.1_226824_227541_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|262aa|down_3|NZ_CP045325.1_227537_228323_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|307aa|down_4|NZ_CP045325.1_228562_229483_+	cd01899, Ygr210, Ygr210 GTPase	NA|455aa|down_5|NZ_CP045325.1_229488_230853_+	NA	NA|610aa|down_6|NZ_CP045325.1_231048_232878_+	PRK04210, PRK04210, phosphoenolpyruvate carboxykinase (GTP)	NA|77aa|down_7|NZ_CP045325.1_233044_233275_+	NA	NA|116aa|down_8|NZ_CP045325.1_233261_233609_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|508aa|down_9|NZ_CP045325.1_233710_235234_+	PRK08276, PRK08276, long-chain-fatty-acid--CoA ligase; Validated
GCF_009363295.1_ASM936329v1	NZ_CP045325	Mycobacterium sp. THAF192 chromosome, complete genome	2	215977-216127	2	CRISPRCasFinder	no		csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG	Orphan	GCCACGGCGGTGCTCTCGTCGCCG	24	1	1	216064-216102	NZ_CP045325.1_222264-222302	NA	2	2	Orphan	csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA|164aa|up_9|NZ_CP045325.1_201014_201506_-,NA|455aa|down_5|NZ_CP045325.1_229488_230853_+,NA|77aa|down_7|NZ_CP045325.1_233044_233275_+	NA|164aa|up_9|NZ_CP045325.1_201014_201506_-	NA	NA|976aa|up_8|NZ_CP045325.1_201502_204430_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|182aa|up_7|NZ_CP045325.1_204696_205242_+	TIGR04530, hemophoreRv0203, hemophore, mycobacterial-type	NA|129aa|up_6|NZ_CP045325.1_205431_205818_+	pfam16525, MHB, Haemophore, haem-binding	NA|223aa|up_5|NZ_CP045325.1_205902_206571_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|452aa|up_4|NZ_CP045325.1_206586_207942_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|123aa|up_3|NZ_CP045325.1_208026_208395_-	pfam11255, DUF3054, Protein of unknown function (DUF3054)	NA|417aa|up_2|NZ_CP045325.1_208381_209632_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|389aa|up_1|NZ_CP045325.1_209650_210817_-	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|384aa|up_0|NZ_CP045325.1_210894_212046_+	pfam01594, AI-2E_transport, AI-2E family transporter	NA|1994aa|down_0|NZ_CP045325.1_217606_223588_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|991aa|down_1|NZ_CP045325.1_223781_226754_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|239aa|down_2|NZ_CP045325.1_226824_227541_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|262aa|down_3|NZ_CP045325.1_227537_228323_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|307aa|down_4|NZ_CP045325.1_228562_229483_+	cd01899, Ygr210, Ygr210 GTPase	NA|455aa|down_5|NZ_CP045325.1_229488_230853_+	NA	NA|610aa|down_6|NZ_CP045325.1_231048_232878_+	PRK04210, PRK04210, phosphoenolpyruvate carboxykinase (GTP)	NA|77aa|down_7|NZ_CP045325.1_233044_233275_+	NA	NA|116aa|down_8|NZ_CP045325.1_233261_233609_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|508aa|down_9|NZ_CP045325.1_233710_235234_+	PRK08276, PRK08276, long-chain-fatty-acid--CoA ligase; Validated
GCF_009363295.1_ASM936329v1	NZ_CP045325	Mycobacterium sp. THAF192 chromosome, complete genome	3	216211-216288	3	CRISPRCasFinder	no		csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG	Orphan	GCCACGGCGGTGCTCTCGTCGCCG	24	1	1	216235-216264	NZ_CP045325.1_222435-222464	NA	1	1	Orphan	csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA|164aa|up_9|NZ_CP045325.1_201014_201506_-,NA|455aa|down_5|NZ_CP045325.1_229488_230853_+,NA|77aa|down_7|NZ_CP045325.1_233044_233275_+	NA|164aa|up_9|NZ_CP045325.1_201014_201506_-	NA	NA|976aa|up_8|NZ_CP045325.1_201502_204430_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|182aa|up_7|NZ_CP045325.1_204696_205242_+	TIGR04530, hemophoreRv0203, hemophore, mycobacterial-type	NA|129aa|up_6|NZ_CP045325.1_205431_205818_+	pfam16525, MHB, Haemophore, haem-binding	NA|223aa|up_5|NZ_CP045325.1_205902_206571_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|452aa|up_4|NZ_CP045325.1_206586_207942_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|123aa|up_3|NZ_CP045325.1_208026_208395_-	pfam11255, DUF3054, Protein of unknown function (DUF3054)	NA|417aa|up_2|NZ_CP045325.1_208381_209632_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|389aa|up_1|NZ_CP045325.1_209650_210817_-	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|384aa|up_0|NZ_CP045325.1_210894_212046_+	pfam01594, AI-2E_transport, AI-2E family transporter	NA|1994aa|down_0|NZ_CP045325.1_217606_223588_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|991aa|down_1|NZ_CP045325.1_223781_226754_-	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|239aa|down_2|NZ_CP045325.1_226824_227541_-	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|262aa|down_3|NZ_CP045325.1_227537_228323_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|307aa|down_4|NZ_CP045325.1_228562_229483_+	cd01899, Ygr210, Ygr210 GTPase	NA|455aa|down_5|NZ_CP045325.1_229488_230853_+	NA	NA|610aa|down_6|NZ_CP045325.1_231048_232878_+	PRK04210, PRK04210, phosphoenolpyruvate carboxykinase (GTP)	NA|77aa|down_7|NZ_CP045325.1_233044_233275_+	NA	NA|116aa|down_8|NZ_CP045325.1_233261_233609_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|508aa|down_9|NZ_CP045325.1_233710_235234_+	PRK08276, PRK08276, long-chain-fatty-acid--CoA ligase; Validated
GCF_009363295.1_ASM936329v1	NZ_CP045325	Mycobacterium sp. THAF192 chromosome, complete genome	4	938237-938353	4	CRISPRCasFinder	no		csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG	Orphan	CTCGCACTTTGCCGCAGCCAACGCAGTCTCAACG	34	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA|118aa|up_6|NZ_CP045325.1_932037_932391_+,NA|298aa|up_3|NZ_CP045325.1_933506_934400_-,NA|296aa|up_2|NZ_CP045325.1_934526_935414_-,NA|776aa|up_1|NZ_CP045325.1_935410_937738_-,NA|90aa|up_0|NZ_CP045325.1_937893_938163_-,NA|101aa|down_5|NZ_CP045325.1_946383_946686_-	NA|335aa|up_9|NZ_CP045325.1_928992_929997_-	TIGR03718, R_switched_Alx, integral membrane protein, TerC family	NA|306aa|up_8|NZ_CP045325.1_930110_931028_-	cd08241, QOR1, Quinone oxidoreductase (QOR)	NA|253aa|up_7|NZ_CP045325.1_931168_931927_-	PRK05557, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Validated	NA|118aa|up_6|NZ_CP045325.1_932037_932391_+	NA	NA|170aa|up_5|NZ_CP045325.1_932400_932910_-	COG2203, FhlA, FOG: GAF domain [Signal transduction mechanisms]	NA|187aa|up_4|NZ_CP045325.1_932914_933475_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|298aa|up_3|NZ_CP045325.1_933506_934400_-	NA	NA|296aa|up_2|NZ_CP045325.1_934526_935414_-	NA	NA|776aa|up_1|NZ_CP045325.1_935410_937738_-	NA	NA|90aa|up_0|NZ_CP045325.1_937893_938163_-	NA	NA|1122aa|down_0|NZ_CP045325.1_939136_942502_-	COG4913, COG4913, Uncharacterized protein conserved in bacteria [Function unknown]	NA|233aa|down_1|NZ_CP045325.1_942494_943193_-	pfam13835, DUF4194, Domain of unknown function (DUF4194)	NA|484aa|down_2|NZ_CP045325.1_943252_944704_-	pfam11855, DUF3375, Protein of unknown function (DUF3375)	NA|194aa|down_3|NZ_CP045325.1_944896_945478_-	TIGR03968, transcriptional_regulator_TetR_family, mycofactocin system transcriptional regulator	NA|202aa|down_4|NZ_CP045325.1_945521_946127_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|101aa|down_5|NZ_CP045325.1_946383_946686_-	NA	NA|578aa|down_6|NZ_CP045325.1_946705_948439_-	PRK05850, PRK05850, acyl-CoA synthetase; Validated	NA|314aa|down_7|NZ_CP045325.1_948896_949838_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|793aa|down_8|NZ_CP045325.1_949952_952331_+	pfam13449, Phytase-like, Esterase-like activity of phytase	NA|528aa|down_9|NZ_CP045325.1_952601_954185_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]
GCF_009363295.1_ASM936329v1	NZ_CP045325	Mycobacterium sp. THAF192 chromosome, complete genome	5	1463339-1463447	5	CRISPRCasFinder	no		csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG	Orphan	GCCGCCGCCGGTGACCGAGGACGTCCCGCCGCCG	34	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA|147aa|up_9|NZ_CP045325.1_1449623_1450064_-,NA|51aa|up_2|NZ_CP045325.1_1458368_1458521_-,NA	NA|147aa|up_9|NZ_CP045325.1_1449623_1450064_-	NA	NA|288aa|up_8|NZ_CP045325.1_1450536_1451400_+	PLN02864, PLN02864, enoyl-CoA hydratase	NA|400aa|up_7|NZ_CP045325.1_1451532_1452732_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|1088aa|up_6|NZ_CP045325.1_1452812_1456076_+	PRK05672, dnaE2, error-prone DNA polymerase; Validated	NA|138aa|up_5|NZ_CP045325.1_1456125_1456539_-	TIGR03667, Rv3369, PPOX class probable F420-dependent enzyme, Rv3369 family	NA|214aa|up_4|NZ_CP045325.1_1456562_1457204_+	cd02062, Nitro_FMN_reductase, nitroreductase family protein	NA|155aa|up_3|NZ_CP045325.1_1457208_1457673_-	COG0219, CspR, Predicted rRNA methylase (SpoU class) [Translation, ribosomal structure and biogenesis]	NA|51aa|up_2|NZ_CP045325.1_1458368_1458521_-	NA	NA|234aa|up_1|NZ_CP045325.1_1458704_1459406_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|507aa|up_0|NZ_CP045325.1_1459510_1461031_+	cd11555, SLC-NCS1sbd_u1, uncharacterized nucleobase-cation-symport-1 (NCS1) transporter subfamily; solute-binding domain	NA|138aa|down_0|NZ_CP045325.1_1463962_1464376_+	COG2018, COG2018, Uncharacterized distant relative of homeotic protein bithoraxoid [General function prediction only]	NA|124aa|down_1|NZ_CP045325.1_1464383_1464755_+	pfam05331, DUF742, Protein of unknown function (DUF742)	NA|192aa|down_2|NZ_CP045325.1_1464735_1465311_+	COG2229, COG2229, Predicted GTPase [General function prediction only]	NA|185aa|down_3|NZ_CP045325.1_1465315_1465870_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|159aa|down_4|NZ_CP045325.1_1465965_1466442_+	PRK11014, PRK11014, HTH-type transcriptional repressor NsrR	NA|406aa|down_5|NZ_CP045325.1_1466438_1467656_+	PRK13289, PRK13289, NO-inducible flavohemoprotein	NA|159aa|down_6|NZ_CP045325.1_1467652_1468129_-	pfam16859, TetR_C_11, Bacterial transcriptional repressor C-terminal	NA|906aa|down_7|NZ_CP045325.1_1468474_1471192_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|132aa|down_8|NZ_CP045325.1_1471244_1471640_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|189aa|down_9|NZ_CP045325.1_1471689_1472256_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family
GCF_009363295.1_ASM936329v1	NZ_CP045325	Mycobacterium sp. THAF192 chromosome, complete genome	6	5750113-5750214	6	CRISPRCasFinder	no		csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG	Orphan	ACGCCCACCTGCAGCCCACGTCGCTCTTCACCTGC	35	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,PD-DExK,cas3,c2c9_V-U4,casR,cas4,DEDDh,DinG,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA,NA|129aa|down_5|NZ_CP045325.1_5757028_5757415_+,NA|155aa|down_9|NZ_CP045325.1_5760424_5760889_-	NA|541aa|up_9|NZ_CP045325.1_5736425_5738048_-	COG5650, COG5650, Predicted integral membrane protein [Function unknown]	NA|819aa|up_8|NZ_CP045325.1_5738185_5740642_-	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|142aa|up_7|NZ_CP045325.1_5740700_5741126_-	pfam17249, DUF5318, Family of unknown function (DUF5318)	NA|289aa|up_6|NZ_CP045325.1_5741249_5742116_+	pfam08044, DUF1707, Domain of unknown function (DUF1707)	NA|182aa|up_5|NZ_CP045325.1_5742241_5742787_+	pfam03551, PadR, Transcriptional regulator PadR-like family	NA|364aa|up_4|NZ_CP045325.1_5742875_5743967_+	TIGR03450, mycothiol_INO1, inositol 1-phosphate synthase, Actinobacterial type	NA|412aa|up_3|NZ_CP045325.1_5744051_5745287_-	PRK01346, PRK01346, enhanced intracellular survival protein Eis	NA|299aa|up_2|NZ_CP045325.1_5745317_5746214_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|695aa|up_1|NZ_CP045325.1_5746345_5748430_+	COG3211, PhoX, Predicted phosphatase [General function prediction only]	NA|270aa|up_0|NZ_CP045325.1_5748567_5749377_+	TIGR03856, F420_MSMEG_2906, probable F420-dependent oxidoreductase, MSMEG_2906 family	NA|147aa|down_0|NZ_CP045325.1_5750338_5750779_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|447aa|down_1|NZ_CP045325.1_5750796_5752137_-	COG3408, GDB1, Glycogen debranching enzyme [Carbohydrate transport and metabolism]	NA|218aa|down_2|NZ_CP045325.1_5752204_5752858_+	PRK08219, PRK08219, SDR family oxidoreductase	NA|959aa|down_3|NZ_CP045325.1_5752909_5755786_-	PRK00390, leuS, leucyl-tRNA synthetase; Validated	NA|322aa|down_4|NZ_CP045325.1_5755993_5756959_+	pfam10738, Lpp-LpqN, Probable lipoprotein LpqN	NA|129aa|down_5|NZ_CP045325.1_5757028_5757415_+	NA	NA|202aa|down_6|NZ_CP045325.1_5757411_5758017_-	PRK00228, PRK00228, YqgE/AlgH family protein	NA|428aa|down_7|NZ_CP045325.1_5758178_5759462_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|262aa|down_8|NZ_CP045325.1_5759472_5760258_+	TIGR03084, conserved_hypothetical_protein, TIGR03084 family protein	NA|155aa|down_9|NZ_CP045325.1_5760424_5760889_-	NA
