assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	1	360593-361291	1	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	TTCGCGAAGCCGATGTTGTAGCTGCCGGTGTTG	33	2	2	360881-360922|360956-360988	NZ_CP007803.1_368897-368938|NZ_CP007803.1_367712-367744	NA	10	10	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|101aa|up_4|NZ_CP007803.1_357604_357907_+,NA|161aa|down_1|NZ_CP007803.1_370686_371169_-,NA|410aa|down_5|NZ_CP007803.1_373285_374515_+	NA|262aa|up_9|NZ_CP007803.1_352299_353085_+	PRK14103, PRK14103, trans-aconitate 2-methyltransferase; Provisional	NA|268aa|up_8|NZ_CP007803.1_353073_353877_-	COG4424, COG4424, Uncharacterized protein conserved in bacteria [Function unknown]	NA|466aa|up_7|NZ_CP007803.1_353886_355284_-	cd16027, SGSH, N-sulfoglucosamine sulfohydrolase (SGSH; sulfamidase)	NA|592aa|up_6|NZ_CP007803.1_355462_357238_+	pfam00934, PE, PE family	NA|76aa|up_5|NZ_CP007803.1_357380_357608_+	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|101aa|up_4|NZ_CP007803.1_357604_357907_+	NA	NA|74aa|up_3|NZ_CP007803.1_357954_358176_+	PHA01748, PHA01748, hypothetical protein	NA|142aa|up_2|NZ_CP007803.1_358172_358598_+	cd18755, PIN_MtVapC3_VapC21-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC3, VapC21 and related proteins	NA|215aa|up_1|NZ_CP007803.1_358722_359367_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|303aa|up_0|NZ_CP007803.1_359363_360272_+	cd09810, LPOR_like_SDR_c_like, light-dependent protochlorophyllide reductase (LPOR)-like, classical (c)-like SDRs	NA|224aa|down_0|NZ_CP007803.1_370027_370699_+	TIGR02476, BluB, 5,6-dimethylbenzimidazole synthase	NA|161aa|down_1|NZ_CP007803.1_370686_371169_-	NA	NA|239aa|down_2|NZ_CP007803.1_371226_371943_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|219aa|down_3|NZ_CP007803.1_372044_372701_+	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|164aa|down_4|NZ_CP007803.1_372770_373262_-	pfam13577, SnoaL_4, SnoaL-like domain	NA|410aa|down_5|NZ_CP007803.1_373285_374515_+	NA	NA|621aa|down_6|NZ_CP007803.1_374669_376532_+	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|129aa|down_7|NZ_CP007803.1_376603_376990_+	COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational modification, protein turnover, chaperones]	NA|212aa|down_8|NZ_CP007803.1_376992_377628_-	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|295aa|down_9|NZ_CP007803.1_377715_378600_+	COG2273, SKN1, Beta-glucanase/Beta-glucan synthetase [Carbohydrate transport and metabolism]
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	2	685925-686001	2	CRISPRCasFinder	no	c2c9_V-U4	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Type V-U4	TGAGGTGCGGCGTGAGCGCGGGT	23	0	0	NA	NA	NA	1	1	TypeV-U4	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|136aa|up_9|NZ_CP007803.1_671820_672228_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|229aa|up_8|NZ_CP007803.1_672287_672974_-	pfam10738, Lpp-LpqN, Probable lipoprotein LpqN	NA|878aa|up_7|NZ_CP007803.1_673127_675761_+	COG3537, COG3537, Putative alpha-1,2-mannosidase [Carbohydrate transport and metabolism]	NA|796aa|up_6|NZ_CP007803.1_675783_678171_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|241aa|up_5|NZ_CP007803.1_678308_679031_+	COG2186, FadR, Transcriptional regulators [Transcription]	NA|266aa|up_4|NZ_CP007803.1_679027_679825_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|296aa|up_3|NZ_CP007803.1_679826_680714_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|405aa|up_2|NZ_CP007803.1_680719_681934_+	pfam11887, Mce4_CUP1, Cholesterol uptake porter CUP1 of Mce4, putative	NA|344aa|up_1|NZ_CP007803.1_681930_682962_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|482aa|up_0|NZ_CP007803.1_682958_684404_+	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|517aa|down_0|NZ_CP007803.1_687136_688687_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|131aa|down_1|NZ_CP007803.1_688738_689131_-	cd18768, PIN_MtVapC4-C5-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4, VapC5, and related proteins	NA|86aa|down_2|NZ_CP007803.1_689127_689385_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|412aa|down_3|NZ_CP007803.1_689567_690803_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|138aa|down_4|NZ_CP007803.1_691053_691467_-	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|79aa|down_5|NZ_CP007803.1_691463_691700_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|169aa|down_6|NZ_CP007803.1_691803_692310_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|157aa|down_7|NZ_CP007803.1_692423_692894_-	PRK10755, PRK10755, two-component system sensor histidine kinase PmrB	NA|254aa|down_8|NZ_CP007803.1_692937_693699_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|104aa|down_9|NZ_CP007803.1_693755_694067_+	pfam03413, PepSY, Peptidase propeptide and YPEB domain
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	3	1181040-1181148	3	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GACAGCCAGCCGGCTGACCCGCCGT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|544aa|up_9|NZ_CP007803.1_1170419_1172051_+	cd12119, ttLC_FACS_AlkK_like, Fatty acyl-CoA synthetases similar to LC-FACS from Thermus thermophiles	NA|339aa|up_8|NZ_CP007803.1_1172126_1173143_+	COG3804, COG3804, Uncharacterized conserved protein related to dihydrodipicolinate reductase [Function unknown]	NA|158aa|up_7|NZ_CP007803.1_1173195_1173669_+	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|up_6|NZ_CP007803.1_1173702_1174566_+	cd01908, YafJ, Glutamine amidotransferases class-II (Gn-AT)_YafJ-type	NA|286aa|up_5|NZ_CP007803.1_1174570_1175428_+	COG1752, RssA, Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only]	NA|361aa|up_4|NZ_CP007803.1_1175428_1176511_-	cd07228, Pat_NTE_like_bacteria, Bacterial patatin-like phospholipase domain containing protein 6	NA|140aa|up_3|NZ_CP007803.1_1176591_1177011_-	pfam17301, LpqV, Putative lipoprotein LpqV	NA|189aa|up_2|NZ_CP007803.1_1177122_1177689_+	cd10548, cupin_CDO, cysteine dioxygenase, cupin domain	NA|132aa|up_1|NZ_CP007803.1_1177685_1178081_+	cd01447, Polysulfide_ST, Polysulfide-sulfurtransferase - Rhodanese Homology Domain	NA|680aa|up_0|NZ_CP007803.1_1178108_1180148_-	pfam00934, PE, PE family	NA|588aa|down_0|NZ_CP007803.1_1182425_1184189_-	COG4425, COG4425, Predicted membrane protein [Function unknown]	NA|258aa|down_1|NZ_CP007803.1_1184185_1184959_-	PRK05862, PRK05862, enoyl-CoA hydratase; Provisional	NA|346aa|down_2|NZ_CP007803.1_1184970_1186008_-	PRK05617, PRK05617, 3-hydroxyisobutyryl-CoA hydrolase; Provisional	NA|279aa|down_3|NZ_CP007803.1_1186194_1187031_+	COG4760, COG4760, Predicted membrane protein [Function unknown]	NA|284aa|down_4|NZ_CP007803.1_1187146_1187998_+	pfam18741, MTES_1575, REase_MTES_1575	NA|406aa|down_5|NZ_CP007803.1_1188071_1189289_-	PRK07851, PRK07851, acetyl-CoA C-acetyltransferase	NA|315aa|down_6|NZ_CP007803.1_1189341_1190286_-	cd01836, FeeA_FeeB_like, SGNH_hydrolase subfamily, FeeA, FeeB and similar esterases/lipases	NA|298aa|down_7|NZ_CP007803.1_1190682_1191576_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|465aa|down_8|NZ_CP007803.1_1191632_1193027_+	TIGR01137, Cystathionine_beta-synthase, cystathionine beta-synthase	NA|241aa|down_9|NZ_CP007803.1_1193228_1193951_+	pfam06271, RDD, RDD family
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	4	1562151-1563223	4	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGTTGCCGCCGGCACCGCCGTCGCCGAT	32	1	1	1562351-1562372	NZ_CP007803.1_1201979-1201958	NA	15	15	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|191aa|up_7|NZ_CP007803.1_1553659_1554232_+,NA	NA|103aa|up_9|NZ_CP007803.1_1551429_1551738_+	pfam00934, PE, PE family	NA|540aa|up_8|NZ_CP007803.1_1551734_1553354_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|191aa|up_7|NZ_CP007803.1_1553659_1554232_+	NA	NA|209aa|up_6|NZ_CP007803.1_1554366_1554993_+	PRK00300, gmk, guanylate kinase; Provisional	NA|111aa|up_5|NZ_CP007803.1_1555058_1555391_+	TIGR00690, DNA-directed_RNA_polymerase_subunit_omega, DNA-directed RNA polymerase, omega subunit	NA|419aa|up_4|NZ_CP007803.1_1555406_1556663_+	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated	NA|404aa|up_3|NZ_CP007803.1_1556790_1558002_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|493aa|up_2|NZ_CP007803.1_1558074_1559553_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|462aa|up_1|NZ_CP007803.1_1559549_1560935_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|345aa|up_0|NZ_CP007803.1_1561012_1562047_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|134aa|down_0|NZ_CP007803.1_1564077_1564479_-	cd18741, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|86aa|down_1|NZ_CP007803.1_1564475_1564733_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|320aa|down_2|NZ_CP007803.1_1564815_1565775_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|321aa|down_3|NZ_CP007803.1_1565799_1566762_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|201aa|down_4|NZ_CP007803.1_1566895_1567498_+	COG3714, COG3714, Predicted membrane protein [Function unknown]	NA|656aa|down_5|NZ_CP007803.1_1567578_1569546_+	PRK14873, PRK14873, primosomal protein N'	NA|275aa|down_6|NZ_CP007803.1_1569563_1570388_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|161aa|down_7|NZ_CP007803.1_1570556_1571039_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|275aa|down_8|NZ_CP007803.1_1571110_1571935_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|313aa|down_9|NZ_CP007803.1_1572131_1573070_+	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	5	2059419-2059673	1	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCNCCGTCGCCGCCNNTGCC	21	2	3	2059500-2059517|2059500-2059517|2059590-2059607	NZ_CP007803.1_395193-395176|NZ_CP007803.1_601253-601236|NZ_CP007803.1_3427120-3427103	NA	5	5	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|88aa|up_0|NZ_CP007803.1_2058778_2059042_-,NA	NA|165aa|up_9|NZ_CP007803.1_2045128_2045623_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|226aa|up_8|NZ_CP007803.1_2045970_2046648_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|942aa|up_7|NZ_CP007803.1_2047006_2049832_+	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|287aa|up_6|NZ_CP007803.1_2050058_2050919_-	PRK03204, PRK03204, haloalkane dehalogenase; Provisional	NA|289aa|up_5|NZ_CP007803.1_2050959_2051826_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|629aa|up_4|NZ_CP007803.1_2051830_2053717_-	TIGR00976, Hypothetical_protein_Rv1835c/MT1883/Mb1866c	NA|678aa|up_3|NZ_CP007803.1_2053732_2055766_-	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|742aa|up_2|NZ_CP007803.1_2055885_2058111_-	PRK02999, PRK02999, malate synthase G; Provisional	NA|132aa|up_1|NZ_CP007803.1_2058386_2058782_-	COG1848, COG1848, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|88aa|up_0|NZ_CP007803.1_2058778_2059042_-	NA	NA|346aa|down_0|NZ_CP007803.1_2060810_2061848_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|456aa|down_1|NZ_CP007803.1_2061847_2063215_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|480aa|down_2|NZ_CP007803.1_2063388_2064828_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|484aa|down_3|NZ_CP007803.1_2064860_2066312_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|317aa|down_4|NZ_CP007803.1_2066341_2067292_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|139aa|down_5|NZ_CP007803.1_2067306_2067723_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|141aa|down_6|NZ_CP007803.1_2068000_2068423_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|101aa|down_7|NZ_CP007803.1_2068471_2068774_+	pfam00547, Urease_gamma, Urease, gamma subunit	NA|105aa|down_8|NZ_CP007803.1_2068770_2069085_+	PRK13202, ureB, urease subunit beta; Reviewed	NA|578aa|down_9|NZ_CP007803.1_2069084_2070818_+	PRK13206, ureC, urease subunit alpha; Reviewed
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	6	2141359-2141578	5	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GTGCCAGCCGGAATCGTGATCGGCGGAACCGTCACCGACGGAATACTCA	49	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|136aa|up_2|NZ_CP007803.1_2129897_2130305_-,NA|127aa|down_5|NZ_CP007803.1_2149094_2149475_-	NA|155aa|up_9|NZ_CP007803.1_2123225_2123690_-	pfam14081, DUF4262, Domain of unknown function (DUF4262)	NA|741aa|up_8|NZ_CP007803.1_2123865_2126088_-	PRK15061, PRK15061, catalase/peroxidase	NA|148aa|up_7|NZ_CP007803.1_2126125_2126569_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|198aa|up_6|NZ_CP007803.1_2126682_2127276_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|202aa|up_5|NZ_CP007803.1_2127358_2127964_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|335aa|up_4|NZ_CP007803.1_2128063_2129068_-	cd08275, MDR3, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|251aa|up_3|NZ_CP007803.1_2129167_2129920_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|136aa|up_2|NZ_CP007803.1_2129897_2130305_-	NA	NA|767aa|up_1|NZ_CP007803.1_2130439_2132740_+	PLN02892, PLN02892, isocitrate lyase	NA|421aa|up_0|NZ_CP007803.1_2136935_2138197_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|155aa|down_0|NZ_CP007803.1_2143605_2144070_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|down_1|NZ_CP007803.1_2144167_2145031_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|424aa|down_2|NZ_CP007803.1_2145068_2146340_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|372aa|down_3|NZ_CP007803.1_2146611_2147727_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|447aa|down_4|NZ_CP007803.1_2147717_2149058_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|127aa|down_5|NZ_CP007803.1_2149094_2149475_-	NA	NA|621aa|down_6|NZ_CP007803.1_2149631_2151494_+	PRK12476, PRK12476, putative fatty-acid--CoA ligase; Provisional	NA|160aa|down_7|NZ_CP007803.1_2151501_2151981_-	pfam09167, DUF1942, Domain of unknown function (DUF1942)	NA|258aa|down_8|NZ_CP007803.1_2152217_2152991_+	COG3361, COG3361, Uncharacterized conserved protein [Function unknown]	NA|256aa|down_9|NZ_CP007803.1_2152994_2153762_-	PRK05867, PRK05867, SDR family oxidoreductase
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	7	3100544-3101691	1,6,2	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4,csm3gr7,csm2gr11,cas10,cas6	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Type III-D,Type III-C,Type III-A,Type III-B	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	12,14,15	15	TypeIII-D,TypeIII-C,TypeIII-A,TypeIII-B	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_8|NZ_CP007803.1_3093823_3094078_-,NA|135aa|up_7|NZ_CP007803.1_3094225_3094630_+,NA|64aa|up_6|NZ_CP007803.1_3094626_3094818_+,NA|86aa|up_4|NZ_CP007803.1_3096404_3096662_+,NA|104aa|up_3|NZ_CP007803.1_3096766_3097078_+,NA|203aa|up_2|NZ_CP007803.1_3097497_3098106_+,NA	NA|92aa|up_9|NZ_CP007803.1_3093372_3093648_+	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|85aa|up_8|NZ_CP007803.1_3093823_3094078_-	NA	NA|135aa|up_7|NZ_CP007803.1_3094225_3094630_+	NA	NA|64aa|up_6|NZ_CP007803.1_3094626_3094818_+	NA	NA|385aa|up_5|NZ_CP007803.1_3095016_3096171_+	pfam00665, rve, Integrase core domain	NA|86aa|up_4|NZ_CP007803.1_3096404_3096662_+	NA	NA|104aa|up_3|NZ_CP007803.1_3096766_3097078_+	NA	NA|203aa|up_2|NZ_CP007803.1_3097497_3098106_+	NA	NA|470aa|up_1|NZ_CP007803.1_3098176_3099586_+	pfam00665, rve, Integrase core domain	NA|271aa|up_0|NZ_CP007803.1_3099582_3100395_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|down_0|NZ_CP007803.1_3101722_3102984_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	csm3gr7|237aa|down_1|NZ_CP007803.1_3103347_3104058_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_2|NZ_CP007803.1_3104067_3104442_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|813aa|down_3|NZ_CP007803.1_3104438_3106877_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_4|NZ_CP007803.1_3106873_3107596_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_5|NZ_CP007803.1_3107995_3108541_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|295aa|down_6|NZ_CP007803.1_3108812_3109697_-	COG2253, COG2253, Uncharacterized conserved protein [Function unknown]	NA|296aa|down_7|NZ_CP007803.1_3109699_3110587_-	pfam09407, AbiEi_1, AbiEi antitoxin C-terminal domain	NA|182aa|down_8|NZ_CP007803.1_3110891_3111437_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|90aa|down_9|NZ_CP007803.1_3111433_3111703_-	COG5552, COG5552, Uncharacterized conserved protein [Function unknown]
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	8	3641373-3641491	7	CRISPRCasFinder	no	cas3	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Unclear	GCCCCTGTGAGTCGAGTGAGCGGAACGAAC	30	1	1	3641403-3641461	NZ_CP007803.1_3641466-3641524	NA	1	1	Unclear	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|138aa|down_6|NZ_CP007803.1_3647124_3647538_-,NA|126aa|down_7|NZ_CP007803.1_3647572_3647950_-	NA|719aa|up_9|NZ_CP007803.1_3628199_3630356_+	COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|223aa|up_8|NZ_CP007803.1_3630352_3631021_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|395aa|up_7|NZ_CP007803.1_3631121_3632306_+	pfam02515, CoA_transf_3, CoA-transferase family III	NA|765aa|up_6|NZ_CP007803.1_3632310_3634605_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|390aa|up_5|NZ_CP007803.1_3634593_3635763_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|175aa|up_4|NZ_CP007803.1_3635787_3636312_-	PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase	NA|430aa|up_3|NZ_CP007803.1_3636308_3637598_-	TIGR01161, N5-carboxyaminoimidazole_ribonucleotide_synthase, phosphoribosylaminoimidazole carboxylase, PurK protein	NA|273aa|up_2|NZ_CP007803.1_3637551_3638370_+	COG2246, COG2246, Predicted membrane protein [Function unknown]	NA|173aa|up_1|NZ_CP007803.1_3638324_3638843_-	pfam03703, bPH_2, Bacterial PH domain	NA|267aa|up_0|NZ_CP007803.1_3638885_3639686_-	COG0340, BirA, Biotin-(acetyl-CoA carboxylase) ligase [Coenzyme metabolism]	NA|223aa|down_0|NZ_CP007803.1_3641757_3642426_+	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|298aa|down_1|NZ_CP007803.1_3642466_3643360_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|144aa|down_2|NZ_CP007803.1_3643356_3643788_+	COG2166, sufE, Cysteine desulfurase SufE subunit [Posttranslational modification, protein turnover, chaperones]	NA|601aa|down_3|NZ_CP007803.1_3643895_3645698_+	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|262aa|down_4|NZ_CP007803.1_3645707_3646493_-	PRK07122, PRK07122, RNA polymerase sigma factor SigF; Reviewed	NA|146aa|down_5|NZ_CP007803.1_3646489_3646927_-	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|138aa|down_6|NZ_CP007803.1_3647124_3647538_-	NA	NA|126aa|down_7|NZ_CP007803.1_3647572_3647950_-	NA	NA|450aa|down_8|NZ_CP007803.1_3647983_3649333_-	PRK08297, PRK08297, L-lysine aminotransferase; Provisional	NA|151aa|down_9|NZ_CP007803.1_3649383_3649836_-	smart00344, HTH_ASNC, helix_turn_helix ASNC type
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	9	3922228-3923233	3	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CNNCGGCGGNACCGGCGGCNNNGGCGGCNNCGGCGGC	37	0	0	NA	NA	NA	12	12	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|280aa|down_2|NZ_CP007803.1_3926700_3927540_+	NA|255aa|up_9|NZ_CP007803.1_3895737_3896502_-	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|318aa|up_8|NZ_CP007803.1_3896727_3897681_-	PRK07792, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional	NA|64aa|up_7|NZ_CP007803.1_3897705_3897897_-	COG1141, Fer, Ferredoxin [Energy production and conversion]	NA|401aa|up_6|NZ_CP007803.1_3898111_3899314_+	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|374aa|up_5|NZ_CP007803.1_3899338_3900460_+	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|503aa|up_4|NZ_CP007803.1_3900530_3902039_+	PRK07867, PRK07867, acyl-CoA synthetase; Validated	NA|1424aa|up_3|NZ_CP007803.1_3902209_3906481_+	pfam00934, PE, PE family	NA|516aa|up_2|NZ_CP007803.1_3911906_3913454_-	PRK07586, PRK07586, acetolactate synthase large subunit	NA|279aa|up_1|NZ_CP007803.1_3913450_3914287_-	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|219aa|up_0|NZ_CP007803.1_3919910_3920567_-	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|549aa|down_0|NZ_CP007803.1_3924093_3925740_-	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|264aa|down_1|NZ_CP007803.1_3925813_3926605_+	PRK07799, PRK07799, crotonase/enoyl-CoA hydratase family protein	NA|280aa|down_2|NZ_CP007803.1_3926700_3927540_+	NA	NA|399aa|down_3|NZ_CP007803.1_3927594_3928791_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|237aa|down_4|NZ_CP007803.1_3928819_3929530_+	pfam06314, ADC, Acetoacetate decarboxylase (ADC)	NA|348aa|down_5|NZ_CP007803.1_3929594_3930638_-	TIGR03559, F420_Rv3520c, probable F420-dependent oxidoreductase, Rv3520c family	NA|304aa|down_6|NZ_CP007803.1_3930790_3931702_+	COG1545, COG1545, Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]	NA|355aa|down_7|NZ_CP007803.1_3931717_3932782_+	PRK07937, PRK07937, lipid-transfer protein; Provisional	NA|395aa|down_8|NZ_CP007803.1_3932798_3933983_+	PRK08313, PRK08313, thiolase domain-containing protein	NA|344aa|down_9|NZ_CP007803.1_3934024_3935056_+	cd14952, NHL_PKND_like, NHL repeat domain of the protein kinase PknD
GCF_000698475.1_ASM69847v1	NZ_CP007803	Mycobacterium tuberculosis K chromosome, complete genome	10	4084264-4084352	8	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCTCGGCGACGATGCGGGCCGGATGACGGCC	31	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|257aa|up_6|NZ_CP007803.1_4074850_4075621_-,NA|233aa|up_0|NZ_CP007803.1_4083368_4084067_-,NA|126aa|down_6|NZ_CP007803.1_4089587_4089965_+	NA|388aa|up_9|NZ_CP007803.1_4070521_4071685_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|up_8|NZ_CP007803.1_4071681_4072734_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|up_7|NZ_CP007803.1_4073232_4074096_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|up_6|NZ_CP007803.1_4074850_4075621_-	NA	NA|549aa|up_5|NZ_CP007803.1_4075617_4077264_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|288aa|up_4|NZ_CP007803.1_4077260_4078124_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|up_3|NZ_CP007803.1_4078116_4079043_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|542aa|up_2|NZ_CP007803.1_4079044_4080670_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|652aa|up_1|NZ_CP007803.1_4081377_4083333_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|233aa|up_0|NZ_CP007803.1_4083368_4084067_-	NA	NA|173aa|down_0|NZ_CP007803.1_4084412_4084931_+	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|328aa|down_1|NZ_CP007803.1_4084931_4085915_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|398aa|down_2|NZ_CP007803.1_4085907_4087101_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|274aa|down_3|NZ_CP007803.1_4087106_4087928_-	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|228aa|down_4|NZ_CP007803.1_4088059_4088743_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|246aa|down_5|NZ_CP007803.1_4088742_4089480_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|126aa|down_6|NZ_CP007803.1_4089587_4089965_+	NA	NA|225aa|down_7|NZ_CP007803.1_4090063_4090738_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|265aa|down_8|NZ_CP007803.1_4090843_4091638_-	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|152aa|down_9|NZ_CP007803.1_4091644_4092100_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
