assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	1	364865-365563	1	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	TTCGCGAAGCCGATGTTGTAGCTGCCGGTGTTG	33	2	2	365153-365194|365228-365260	NZ_CP002871.1_373169-373210|NZ_CP002871.1_371984-372016	NA	10	10	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|101aa|up_4|NZ_CP002871.1_361877_362180_+,NA|161aa|down_1|NZ_CP002871.1_374958_375441_-,NA|410aa|down_5|NZ_CP002871.1_377557_378787_+	NA|262aa|up_9|NZ_CP002871.1_356572_357358_+	PRK14103, PRK14103, trans-aconitate 2-methyltransferase; Provisional	NA|268aa|up_8|NZ_CP002871.1_357346_358150_-	COG4424, COG4424, Uncharacterized protein conserved in bacteria [Function unknown]	NA|466aa|up_7|NZ_CP002871.1_358159_359557_-	cd16027, SGSH, N-sulfoglucosamine sulfohydrolase (SGSH; sulfamidase)	NA|592aa|up_6|NZ_CP002871.1_359735_361511_+	pfam00934, PE, PE family	NA|76aa|up_5|NZ_CP002871.1_361653_361881_+	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|101aa|up_4|NZ_CP002871.1_361877_362180_+	NA	NA|74aa|up_3|NZ_CP002871.1_362227_362449_+	PHA01748, PHA01748, hypothetical protein	NA|142aa|up_2|NZ_CP002871.1_362445_362871_+	cd18755, PIN_MtVapC3_VapC21-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC3, VapC21 and related proteins	NA|211aa|up_1|NZ_CP002871.1_363006_363639_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|303aa|up_0|NZ_CP002871.1_363635_364544_+	cd09810, LPOR_like_SDR_c_like, light-dependent protochlorophyllide reductase (LPOR)-like, classical (c)-like SDRs	NA|224aa|down_0|NZ_CP002871.1_374299_374971_+	TIGR02476, BluB, 5,6-dimethylbenzimidazole synthase	NA|161aa|down_1|NZ_CP002871.1_374958_375441_-	NA	NA|239aa|down_2|NZ_CP002871.1_375498_376215_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|219aa|down_3|NZ_CP002871.1_376316_376973_+	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|164aa|down_4|NZ_CP002871.1_377042_377534_-	pfam13577, SnoaL_4, SnoaL-like domain	NA|410aa|down_5|NZ_CP002871.1_377557_378787_+	NA	NA|621aa|down_6|NZ_CP002871.1_378941_380804_+	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|129aa|down_7|NZ_CP002871.1_380875_381262_+	COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational modification, protein turnover, chaperones]	NA|212aa|down_8|NZ_CP002871.1_381264_381900_-	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|295aa|down_9|NZ_CP002871.1_381987_382872_+	COG2273, SKN1, Beta-glucanase/Beta-glucan synthetase [Carbohydrate transport and metabolism]
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	2	690398-690474	2	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	TGAGGTGCGGCGTGAGCGCGGGT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|58aa|down_4|NZ_CP002871.1_695392_695566_+	NA|136aa|up_9|NZ_CP002871.1_676293_676701_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|229aa|up_8|NZ_CP002871.1_676760_677447_-	pfam10738, Lpp-LpqN, Probable lipoprotein LpqN	NA|878aa|up_7|NZ_CP002871.1_677600_680234_+	COG3537, COG3537, Putative alpha-1,2-mannosidase [Carbohydrate transport and metabolism]	NA|796aa|up_6|NZ_CP002871.1_680256_682644_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|241aa|up_5|NZ_CP002871.1_682781_683504_+	COG2186, FadR, Transcriptional regulators [Transcription]	NA|266aa|up_4|NZ_CP002871.1_683500_684298_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|296aa|up_3|NZ_CP002871.1_684299_685187_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|405aa|up_2|NZ_CP002871.1_685192_686407_+	pfam11887, Mce4_CUP1, Cholesterol uptake porter CUP1 of Mce4, putative	NA|344aa|up_1|NZ_CP002871.1_686403_687435_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|482aa|up_0|NZ_CP002871.1_687431_688877_+	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|517aa|down_0|NZ_CP002871.1_691609_693160_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|131aa|down_1|NZ_CP002871.1_693211_693604_-	cd18768, PIN_MtVapC4-C5-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4, VapC5, and related proteins	NA|86aa|down_2|NZ_CP002871.1_693600_693858_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|412aa|down_3|NZ_CP002871.1_694040_695276_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|58aa|down_4|NZ_CP002871.1_695392_695566_+	NA	NA|138aa|down_5|NZ_CP002871.1_695526_695940_-	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|79aa|down_6|NZ_CP002871.1_695936_696173_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|169aa|down_7|NZ_CP002871.1_696276_696783_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|177aa|down_8|NZ_CP002871.1_696896_697427_-	PRK10755, PRK10755, two-component system sensor histidine kinase PmrB	NA|254aa|down_9|NZ_CP002871.1_697410_698172_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	3	1572341-1573413	3	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGTTGCCGCCGGCACCGCCGTCGCCGAT	32	1	1	1572541-1572562	NZ_CP002871.1_1210758-1210737	NA	15	15	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|106aa|up_7|NZ_CP002871.1_1564104_1564422_+,NA	NA|103aa|up_9|NZ_CP002871.1_1561619_1561928_+	pfam00934, PE, PE family	NA|540aa|up_8|NZ_CP002871.1_1561924_1563544_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|106aa|up_7|NZ_CP002871.1_1564104_1564422_+	NA	NA|209aa|up_6|NZ_CP002871.1_1564556_1565183_+	PRK00300, gmk, guanylate kinase; Provisional	NA|111aa|up_5|NZ_CP002871.1_1565248_1565581_+	TIGR00690, DNA-directed_RNA_polymerase_subunit_omega, DNA-directed RNA polymerase, omega subunit	NA|419aa|up_4|NZ_CP002871.1_1565596_1566853_+	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated	NA|404aa|up_3|NZ_CP002871.1_1566980_1568192_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|493aa|up_2|NZ_CP002871.1_1568264_1569743_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|462aa|up_1|NZ_CP002871.1_1569739_1571125_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|345aa|up_0|NZ_CP002871.1_1571202_1572237_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|134aa|down_0|NZ_CP002871.1_1574267_1574669_-	cd18741, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|86aa|down_1|NZ_CP002871.1_1574665_1574923_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|320aa|down_2|NZ_CP002871.1_1575005_1575965_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|321aa|down_3|NZ_CP002871.1_1575989_1576952_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|246aa|down_4|NZ_CP002871.1_1576950_1577688_+	COG3714, COG3714, Predicted membrane protein [Function unknown]	NA|656aa|down_5|NZ_CP002871.1_1577768_1579736_+	PRK14873, PRK14873, primosomal protein N'	NA|275aa|down_6|NZ_CP002871.1_1579753_1580578_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|161aa|down_7|NZ_CP002871.1_1580746_1581229_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|275aa|down_8|NZ_CP002871.1_1581300_1582125_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|313aa|down_9|NZ_CP002871.1_1582321_1583260_+	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	4	2070813-2071067	1	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCNCCGTCGCCGCCNNTGCC	21	2	3	2070894-2070911|2070894-2070911|2070984-2071001	NZ_CP002871.1_399465-399448|NZ_CP002871.1_605732-605715|NZ_CP002871.1_3438254-3438237	NA	5	5	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|88aa|up_0|NZ_CP002871.1_2070172_2070436_-,NA	NA|165aa|up_9|NZ_CP002871.1_2056522_2057017_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|211aa|up_8|NZ_CP002871.1_2057409_2058042_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|942aa|up_7|NZ_CP002871.1_2058400_2061226_+	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|287aa|up_6|NZ_CP002871.1_2061452_2062313_-	PRK03204, PRK03204, haloalkane dehalogenase; Provisional	NA|289aa|up_5|NZ_CP002871.1_2062353_2063220_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|629aa|up_4|NZ_CP002871.1_2063224_2065111_-	TIGR00976, Hypothetical_protein_Rv1835c/MT1883/Mb1866c	NA|678aa|up_3|NZ_CP002871.1_2065126_2067160_-	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|742aa|up_2|NZ_CP002871.1_2067279_2069505_-	PRK02999, PRK02999, malate synthase G; Provisional	NA|132aa|up_1|NZ_CP002871.1_2069780_2070176_-	COG1848, COG1848, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|88aa|up_0|NZ_CP002871.1_2070172_2070436_-	NA	NA|346aa|down_0|NZ_CP002871.1_2072204_2073242_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|456aa|down_1|NZ_CP002871.1_2073241_2074609_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|480aa|down_2|NZ_CP002871.1_2074782_2076222_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|484aa|down_3|NZ_CP002871.1_2076254_2077706_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|317aa|down_4|NZ_CP002871.1_2077735_2078686_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|139aa|down_5|NZ_CP002871.1_2078700_2079117_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|141aa|down_6|NZ_CP002871.1_2079394_2079817_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|101aa|down_7|NZ_CP002871.1_2079865_2080168_+	pfam00547, Urease_gamma, Urease, gamma subunit	NA|105aa|down_8|NZ_CP002871.1_2080164_2080479_+	PRK13202, ureB, urease subunit beta; Reviewed	NA|578aa|down_9|NZ_CP002871.1_2080478_2082212_+	PRK13206, ureC, urease subunit alpha; Reviewed
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	5	2153960-2154179	4	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GTGCCAGCCGGAATCGTGATCGGCGGAACCGTCACCGACGGAATACTCA	49	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|136aa|up_2|NZ_CP002871.1_2142495_2142903_-,NA|127aa|down_5|NZ_CP002871.1_2161695_2162076_-	NA|155aa|up_9|NZ_CP002871.1_2135823_2136288_-	pfam14081, DUF4262, Domain of unknown function (DUF4262)	NA|741aa|up_8|NZ_CP002871.1_2136463_2138686_-	PRK15061, PRK15061, catalase/peroxidase	NA|148aa|up_7|NZ_CP002871.1_2138723_2139167_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|198aa|up_6|NZ_CP002871.1_2139280_2139874_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|202aa|up_5|NZ_CP002871.1_2139956_2140562_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|335aa|up_4|NZ_CP002871.1_2140661_2141666_-	cd08275, MDR3, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|251aa|up_3|NZ_CP002871.1_2141765_2142518_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|136aa|up_2|NZ_CP002871.1_2142495_2142903_-	NA	NA|767aa|up_1|NZ_CP002871.1_2143037_2145338_+	PLN02892, PLN02892, isocitrate lyase	NA|421aa|up_0|NZ_CP002871.1_2149536_2150798_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|155aa|down_0|NZ_CP002871.1_2156206_2156671_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|down_1|NZ_CP002871.1_2156768_2157632_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|424aa|down_2|NZ_CP002871.1_2157669_2158941_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|372aa|down_3|NZ_CP002871.1_2159212_2160328_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|447aa|down_4|NZ_CP002871.1_2160318_2161659_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|127aa|down_5|NZ_CP002871.1_2161695_2162076_-	NA	NA|621aa|down_6|NZ_CP002871.1_2162232_2164095_+	PRK12476, PRK12476, putative fatty-acid--CoA ligase; Provisional	NA|160aa|down_7|NZ_CP002871.1_2164102_2164582_-	pfam09167, DUF1942, Domain of unknown function (DUF1942)	NA|258aa|down_8|NZ_CP002871.1_2164818_2165592_+	COG3361, COG3361, Uncharacterized conserved protein [Function unknown]	NA|256aa|down_9|NZ_CP002871.1_2165595_2166363_-	PRK05867, PRK05867, SDR family oxidoreductase
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	6	3110044-3111191	1,5,2	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4,csm3gr7,csm2gr11,cas10,cas6	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Type III-D,Type III-A,Type III-C,Type III-B	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	14,14,15	15	TypeIII-D,TypeIII-A,TypeIII-C,TypeIII-B	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_7|NZ_CP002871.1_3103323_3103578_-,NA|135aa|up_6|NZ_CP002871.1_3103725_3104130_+,NA|64aa|up_5|NZ_CP002871.1_3104126_3104318_+,NA|86aa|up_4|NZ_CP002871.1_3105904_3106162_+,NA|104aa|up_3|NZ_CP002871.1_3106266_3106578_+,NA|203aa|up_2|NZ_CP002871.1_3106997_3107606_+,NA	NA|348aa|up_9|NZ_CP002871.1_3101638_3102682_-	COG5586, COG5586, Uncharacterized conserved protein [Function unknown]	NA|92aa|up_8|NZ_CP002871.1_3102872_3103148_+	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|85aa|up_7|NZ_CP002871.1_3103323_3103578_-	NA	NA|135aa|up_6|NZ_CP002871.1_3103725_3104130_+	NA	NA|64aa|up_5|NZ_CP002871.1_3104126_3104318_+	NA	NA|86aa|up_4|NZ_CP002871.1_3105904_3106162_+	NA	NA|104aa|up_3|NZ_CP002871.1_3106266_3106578_+	NA	NA|203aa|up_2|NZ_CP002871.1_3106997_3107606_+	NA	NA|470aa|up_1|NZ_CP002871.1_3107676_3109086_+	pfam00665, rve, Integrase core domain	NA|271aa|up_0|NZ_CP002871.1_3109082_3109895_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|down_0|NZ_CP002871.1_3111222_3112484_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	csm3gr7|237aa|down_1|NZ_CP002871.1_3112847_3113558_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_2|NZ_CP002871.1_3113567_3113942_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|813aa|down_3|NZ_CP002871.1_3113938_3116377_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|263aa|down_4|NZ_CP002871.1_3116373_3117162_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_5|NZ_CP002871.1_3117495_3118041_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|295aa|down_6|NZ_CP002871.1_3118312_3119197_-	COG2253, COG2253, Uncharacterized conserved protein [Function unknown]	NA|296aa|down_7|NZ_CP002871.1_3119199_3120087_-	pfam09407, AbiEi_1, AbiEi antitoxin C-terminal domain	NA|182aa|down_8|NZ_CP002871.1_3120391_3120937_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|90aa|down_9|NZ_CP002871.1_3120933_3121203_-	COG5552, COG5552, Uncharacterized conserved protein [Function unknown]
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	7	3652130-3652248	6	CRISPRCasFinder	no	cas3	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Unclear	GCCCCTGTGAGTCGAGTGAGCGGAACGAAC	30	1	1	3652160-3652218	NZ_CP002871.1_3652223-3652281	NA	1	1	Unclear	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|104aa|down_6|NZ_CP002871.1_3657881_3658193_-,NA|126aa|down_7|NZ_CP002871.1_3658329_3658707_-	NA|719aa|up_9|NZ_CP002871.1_3638956_3641113_+	COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|223aa|up_8|NZ_CP002871.1_3641109_3641778_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|395aa|up_7|NZ_CP002871.1_3641878_3643063_+	pfam02515, CoA_transf_3, CoA-transferase family III	NA|757aa|up_6|NZ_CP002871.1_3643091_3645362_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|390aa|up_5|NZ_CP002871.1_3645350_3646520_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|175aa|up_4|NZ_CP002871.1_3646544_3647069_-	PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase	NA|430aa|up_3|NZ_CP002871.1_3647065_3648355_-	TIGR01161, N5-carboxyaminoimidazole_ribonucleotide_synthase, phosphoribosylaminoimidazole carboxylase, PurK protein	NA|224aa|up_2|NZ_CP002871.1_3648455_3649127_+	COG2246, COG2246, Predicted membrane protein [Function unknown]	NA|173aa|up_1|NZ_CP002871.1_3649081_3649600_-	pfam03703, bPH_2, Bacterial PH domain	NA|267aa|up_0|NZ_CP002871.1_3649642_3650443_-	COG0340, BirA, Biotin-(acetyl-CoA carboxylase) ligase [Coenzyme metabolism]	NA|223aa|down_0|NZ_CP002871.1_3652514_3653183_+	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|298aa|down_1|NZ_CP002871.1_3653223_3654117_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|144aa|down_2|NZ_CP002871.1_3654113_3654545_+	COG2166, sufE, Cysteine desulfurase SufE subunit [Posttranslational modification, protein turnover, chaperones]	NA|601aa|down_3|NZ_CP002871.1_3654652_3656455_+	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|262aa|down_4|NZ_CP002871.1_3656464_3657250_-	PRK07122, PRK07122, RNA polymerase sigma factor SigF; Reviewed	NA|146aa|down_5|NZ_CP002871.1_3657246_3657684_-	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|104aa|down_6|NZ_CP002871.1_3657881_3658193_-	NA	NA|126aa|down_7|NZ_CP002871.1_3658329_3658707_-	NA	NA|450aa|down_8|NZ_CP002871.1_3658740_3660090_-	PRK08297, PRK08297, L-lysine aminotransferase; Provisional	NA|151aa|down_9|NZ_CP002871.1_3660140_3660593_-	smart00344, HTH_ASNC, helix_turn_helix ASNC type
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	8	3732870-3733446	3	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCGCCGTTGCCNCCGNNGCCGCCG	25	0	0	NA	NA	NA	7	7	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|96aa|down_4|NZ_CP002871.1_3747678_3747966_-	NA|147aa|up_9|NZ_CP002871.1_3713410_3713851_+	cd04770, HTH_HMRTR, Helix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators	NA|290aa|up_8|NZ_CP002871.1_3713884_3714754_-	TIGR00766, Uncharacterized_protein_Dda3937_02003, inner membrane protein YhjD	NA|337aa|up_7|NZ_CP002871.1_3714774_3715785_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|215aa|up_6|NZ_CP002871.1_3716057_3716702_+	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|410aa|up_5|NZ_CP002871.1_3716768_3717998_-	PRK08299, PRK08299, NADP-dependent isocitrate dehydrogenase	NA|450aa|up_4|NZ_CP002871.1_3718280_3719630_+	PRK07812, PRK07812, O-acetylhomoserine aminocarboxypropyltransferase; Validated	NA|380aa|up_3|NZ_CP002871.1_3719641_3720781_+	PRK00175, metX, homoserine O-acetyltransferase; Provisional	NA|244aa|up_2|NZ_CP002871.1_3720777_3721509_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|2538aa|up_1|NZ_CP002871.1_3721517_3729131_-	pfam00823, PPE, PPE family	NA|327aa|up_0|NZ_CP002871.1_3729179_3730160_-	pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex non-fungal	NA|86aa|down_0|NZ_CP002871.1_3735548_3735806_-	pfam11222, DUF3017, Protein of unknown function (DUF3017)	NA|3158aa|down_1|NZ_CP002871.1_3736061_3745535_-	pfam00823, PPE, PPE family	NA|149aa|down_2|NZ_CP002871.1_3746160_3746607_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|247aa|down_3|NZ_CP002871.1_3746643_3747384_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|96aa|down_4|NZ_CP002871.1_3747678_3747966_-	NA	NA|130aa|down_5|NZ_CP002871.1_3761466_3761856_+	pfam05305, DUF732, Protein of unknown function (DUF732)	NA|98aa|down_6|NZ_CP002871.1_3761869_3762163_-	pfam11222, DUF3017, Protein of unknown function (DUF3017)	NA|282aa|down_7|NZ_CP002871.1_3762159_3763005_-	PRK14193, PRK14193, bifunctional 5,10-methylene-tetrahydrofolate dehydrogenase/ 5,10-methylene-tetrahydrofolate cyclohydrolase; Provisional	NA|92aa|down_8|NZ_CP002871.1_3763128_3763404_+	COG2161, StbD, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|86aa|down_9|NZ_CP002871.1_3763400_3763658_+	pfam06769, YoeB_toxin, YoeB-like toxin of bacterial type II toxin-antitoxin system
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	9	3842156-3842245	7	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCAGGCGTTGGGCTGGCTGCCGAT	24	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|77aa|up_6|NZ_CP002871.1_3834192_3834423_+,NA|121aa|up_5|NZ_CP002871.1_3834526_3834889_-,NA|121aa|down_0|NZ_CP002871.1_3843230_3843593_-,NA|52aa|down_2|NZ_CP002871.1_3845434_3845590_-,NA|70aa|down_3|NZ_CP002871.1_3845614_3845824_-,NA|238aa|down_6|NZ_CP002871.1_3849805_3850519_-	NA|212aa|up_9|NZ_CP002871.1_3831871_3832507_-	COG1214, COG1214, Inactive homolog of metal-dependent proteases, putative molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|169aa|up_8|NZ_CP002871.1_3832503_3833010_-	COG0802, COG0802, Predicted ATPase or kinase [General function prediction only]	NA|387aa|up_7|NZ_CP002871.1_3833006_3834167_-	PRK00053, alr, alanine racemase; Reviewed	NA|77aa|up_6|NZ_CP002871.1_3834192_3834423_+	NA	NA|121aa|up_5|NZ_CP002871.1_3834526_3834889_-	NA	NA|177aa|up_4|NZ_CP002871.1_3835847_3836378_+	pfam00823, PPE, PPE family	NA|252aa|up_3|NZ_CP002871.1_3836695_3837451_-	COG1484, DnaC, DNA replication protein [DNA replication, recombination, and repair]	NA|421aa|up_2|NZ_CP002871.1_3837534_3838796_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|100aa|up_1|NZ_CP002871.1_3840934_3841234_+	pfam00934, PE, PE family	NA|181aa|up_0|NZ_CP002871.1_3841319_3841862_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|121aa|down_0|NZ_CP002871.1_3843230_3843593_-	NA	NA|179aa|down_1|NZ_CP002871.1_3843755_3844292_+	pfam00823, PPE, PPE family	NA|52aa|down_2|NZ_CP002871.1_3845434_3845590_-	NA	NA|70aa|down_3|NZ_CP002871.1_3845614_3845824_-	NA	NA|461aa|down_4|NZ_CP002871.1_3846962_3848345_-	TIGR01788, Glutamate_decarboxylase_alpha_GAD-alpha	NA|474aa|down_5|NZ_CP002871.1_3848382_3849804_-	pfam01256, Carb_kinase, Carbohydrate kinase	NA|238aa|down_6|NZ_CP002871.1_3849805_3850519_-	NA	NA|285aa|down_7|NZ_CP002871.1_3850529_3851384_-	pfam14494, DUF4436, Domain of unknown function (DUF4436)	NA|625aa|down_8|NZ_CP002871.1_3851605_3853480_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|159aa|down_9|NZ_CP002871.1_3853501_3853978_+	pfam10708, DUF2510, Protein of unknown function (DUF2510)
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	10	3941910-3945899	4	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CGGCGGNNNCGGCGGNNNNGGCGGNNCCGGCGG	33	4	4	3941943-3942065|3942257-3942343|3944105-3944143|3944714-3944752	NZ_CP002871.1_3926838-3926960|NZ_CP002871.1_3927216-3927302|NZ_CP002871.1_3928182-3928220|NZ_CP002871.1_3928182-3928220	NA	40	40	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|280aa|down_2|NZ_CP002871.1_3949196_3950036_+	NA|255aa|up_9|NZ_CP002871.1_3915384_3916149_-	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|318aa|up_8|NZ_CP002871.1_3916374_3917328_-	PRK07792, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional	NA|64aa|up_7|NZ_CP002871.1_3917352_3917544_-	COG1141, Fer, Ferredoxin [Energy production and conversion]	NA|401aa|up_6|NZ_CP002871.1_3917758_3918961_+	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|374aa|up_5|NZ_CP002871.1_3918985_3920107_+	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|503aa|up_4|NZ_CP002871.1_3920177_3921686_+	PRK07867, PRK07867, acyl-CoA synthetase; Validated	NA|1384aa|up_3|NZ_CP002871.1_3921856_3926008_+	pfam00934, PE, PE family	NA|1621aa|up_2|NZ_CP002871.1_3926298_3931161_+	pfam00934, PE, PE family	NA|516aa|up_1|NZ_CP002871.1_3931327_3932875_-	PRK07586, PRK07586, acetolactate synthase large subunit	NA|279aa|up_0|NZ_CP002871.1_3932871_3933708_-	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|549aa|down_0|NZ_CP002871.1_3946589_3948236_-	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|264aa|down_1|NZ_CP002871.1_3948309_3949101_+	PRK07799, PRK07799, crotonase/enoyl-CoA hydratase family protein	NA|280aa|down_2|NZ_CP002871.1_3949196_3950036_+	NA	NA|399aa|down_3|NZ_CP002871.1_3950090_3951287_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|237aa|down_4|NZ_CP002871.1_3951315_3952026_+	pfam06314, ADC, Acetoacetate decarboxylase (ADC)	NA|344aa|down_5|NZ_CP002871.1_3952090_3953122_-	TIGR03559, F420_Rv3520c, probable F420-dependent oxidoreductase, Rv3520c family	NA|335aa|down_6|NZ_CP002871.1_3953193_3954198_+	COG1545, COG1545, Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]	NA|355aa|down_7|NZ_CP002871.1_3954213_3955278_+	PRK07937, PRK07937, lipid-transfer protein; Provisional	NA|395aa|down_8|NZ_CP002871.1_3955294_3956479_+	PRK08313, PRK08313, thiolase domain-containing protein	NA|344aa|down_9|NZ_CP002871.1_3956520_3957552_+	cd14952, NHL_PKND_like, NHL repeat domain of the protein kinase PknD
GCF_000572125.1_ASM57212v1	NZ_CP002871	Mycobacterium tuberculosis HKBS1 chromosome, complete genome	11	4106793-4106881	8	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCTCGGCGACGATGCGGGCCGGATGACGGCC	31	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|257aa|up_6|NZ_CP002871.1_4097379_4098150_-,NA|233aa|up_0|NZ_CP002871.1_4105897_4106596_-,NA|126aa|down_6|NZ_CP002871.1_4112116_4112494_+	NA|388aa|up_9|NZ_CP002871.1_4093050_4094214_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|up_8|NZ_CP002871.1_4094210_4095263_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|up_7|NZ_CP002871.1_4095761_4096625_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|up_6|NZ_CP002871.1_4097379_4098150_-	NA	NA|549aa|up_5|NZ_CP002871.1_4098146_4099793_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|288aa|up_4|NZ_CP002871.1_4099789_4100653_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|up_3|NZ_CP002871.1_4100645_4101572_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|542aa|up_2|NZ_CP002871.1_4101573_4103199_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|652aa|up_1|NZ_CP002871.1_4103906_4105862_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|233aa|up_0|NZ_CP002871.1_4105897_4106596_-	NA	NA|173aa|down_0|NZ_CP002871.1_4106941_4107460_+	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|328aa|down_1|NZ_CP002871.1_4107460_4108444_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|398aa|down_2|NZ_CP002871.1_4108436_4109630_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|274aa|down_3|NZ_CP002871.1_4109635_4110457_-	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|216aa|down_4|NZ_CP002871.1_4110588_4111236_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|246aa|down_5|NZ_CP002871.1_4111271_4112009_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|126aa|down_6|NZ_CP002871.1_4112116_4112494_+	NA	NA|225aa|down_7|NZ_CP002871.1_4112592_4113267_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|265aa|down_8|NZ_CP002871.1_4113372_4114167_-	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|152aa|down_9|NZ_CP002871.1_4114173_4114629_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
