assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	1	334509-334696	1	PILER-CR	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGTTGCCGAACAACCACCCGCCGGCCCCGCCGGCAGCCCCGGT	44	0	0	NA	NA	NA	2	2	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|61aa|up_0|NC_021740.1_333168_333351_-,NA	NA|398aa|up_9|NC_021740.1_323346_324540_-	COG3285, COG3285, Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]	NA|561aa|up_8|NC_021740.1_324575_326258_+	PRK07788, PRK07788, acyl-CoA synthetase; Validated	NA|732aa|up_7|NC_021740.1_326274_328470_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|378aa|up_6|NC_021740.1_328583_329717_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|207aa|up_5|NC_021740.1_329713_330334_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|194aa|up_4|NC_021740.1_330430_331012_+	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|242aa|up_3|NC_021740.1_330941_331667_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|307aa|up_2|NC_021740.1_331756_332677_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|143aa|up_1|NC_021740.1_332716_333145_-	cd18678, PIN_MtVapC25_VapC33-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC25, VapC33, and related proteins	NA|61aa|up_0|NC_021740.1_333168_333351_-	NA	NA|838aa|down_0|NC_021740.1_336568_339082_-	pfam00934, PE, PE family	NA|537aa|down_1|NC_021740.1_339372_340983_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|303aa|down_2|NC_021740.1_341006_341915_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|632aa|down_3|NC_021740.1_342138_344034_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|1331aa|down_4|NC_021740.1_345643_349636_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|103aa|down_5|NC_021740.1_349632_349941_+	pfam00934, PE, PE family	NA|514aa|down_6|NC_021740.1_349943_351485_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|98aa|down_7|NC_021740.1_351533_351827_+	TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target	NA|97aa|down_8|NC_021740.1_351856_352147_+	COG4842, COG4842, Uncharacterized protein conserved in bacteria [Function unknown]	NA|296aa|down_9|NC_021740.1_352157_353045_+	pfam14011, ESX-1_EspG, EspG family
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	2	335394-335966	1	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGGCNCCGCCGNNGCCG	21	3	3	335814-335837|335859-335891|335913-335945	NC_021740.1_338595-338618|NC_021740.1_338622-338654|NC_021740.1_338676-338708	NA	10	10	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|61aa|up_0|NC_021740.1_333168_333351_-,NA	NA|398aa|up_9|NC_021740.1_323346_324540_-	COG3285, COG3285, Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]	NA|561aa|up_8|NC_021740.1_324575_326258_+	PRK07788, PRK07788, acyl-CoA synthetase; Validated	NA|732aa|up_7|NC_021740.1_326274_328470_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|378aa|up_6|NC_021740.1_328583_329717_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|207aa|up_5|NC_021740.1_329713_330334_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|194aa|up_4|NC_021740.1_330430_331012_+	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|242aa|up_3|NC_021740.1_330941_331667_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|307aa|up_2|NC_021740.1_331756_332677_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|143aa|up_1|NC_021740.1_332716_333145_-	cd18678, PIN_MtVapC25_VapC33-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC25, VapC33, and related proteins	NA|61aa|up_0|NC_021740.1_333168_333351_-	NA	NA|838aa|down_0|NC_021740.1_336568_339082_-	pfam00934, PE, PE family	NA|537aa|down_1|NC_021740.1_339372_340983_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|303aa|down_2|NC_021740.1_341006_341915_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|632aa|down_3|NC_021740.1_342138_344034_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|1331aa|down_4|NC_021740.1_345643_349636_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|103aa|down_5|NC_021740.1_349632_349941_+	pfam00934, PE, PE family	NA|514aa|down_6|NC_021740.1_349943_351485_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|98aa|down_7|NC_021740.1_351533_351827_+	TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target	NA|97aa|down_8|NC_021740.1_351856_352147_+	COG4842, COG4842, Uncharacterized protein conserved in bacteria [Function unknown]	NA|296aa|down_9|NC_021740.1_352157_353045_+	pfam14011, ESX-1_EspG, EspG family
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	3	366472-367170	1	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	TTCGCGAAGCCGATGTTGTAGCTGCCGGTGTTG	33	2	2	366760-366801|366835-366867	NC_021740.1_374791-374832|NC_021740.1_373606-373638	NA	10	10	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|101aa|up_4|NC_021740.1_363484_363787_+,NA|161aa|down_1|NC_021740.1_376580_377063_-,NA|410aa|down_5|NC_021740.1_379179_380409_+	NA|262aa|up_9|NC_021740.1_358179_358965_+	PRK14103, PRK14103, trans-aconitate 2-methyltransferase; Provisional	NA|268aa|up_8|NC_021740.1_358953_359757_-	COG4424, COG4424, Uncharacterized protein conserved in bacteria [Function unknown]	NA|466aa|up_7|NC_021740.1_359766_361164_-	cd16027, SGSH, N-sulfoglucosamine sulfohydrolase (SGSH; sulfamidase)	NA|592aa|up_6|NC_021740.1_361342_363118_+	pfam00934, PE, PE family	NA|76aa|up_5|NC_021740.1_363260_363488_+	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|101aa|up_4|NC_021740.1_363484_363787_+	NA	NA|74aa|up_3|NC_021740.1_363834_364056_+	PHA01748, PHA01748, hypothetical protein	NA|142aa|up_2|NC_021740.1_364052_364478_+	cd18755, PIN_MtVapC3_VapC21-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC3, VapC21 and related proteins	NA|211aa|up_1|NC_021740.1_364613_365246_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|303aa|up_0|NC_021740.1_365242_366151_+	cd09810, LPOR_like_SDR_c_like, light-dependent protochlorophyllide reductase (LPOR)-like, classical (c)-like SDRs	NA|224aa|down_0|NC_021740.1_375921_376593_+	TIGR02476, BluB, 5,6-dimethylbenzimidazole synthase	NA|161aa|down_1|NC_021740.1_376580_377063_-	NA	NA|239aa|down_2|NC_021740.1_377120_377837_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|219aa|down_3|NC_021740.1_377938_378595_+	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|164aa|down_4|NC_021740.1_378664_379156_-	pfam13577, SnoaL_4, SnoaL-like domain	NA|410aa|down_5|NC_021740.1_379179_380409_+	NA	NA|621aa|down_6|NC_021740.1_380563_382426_+	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|129aa|down_7|NC_021740.1_382497_382884_+	COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational modification, protein turnover, chaperones]	NA|221aa|down_8|NC_021740.1_382886_383549_-	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|295aa|down_9|NC_021740.1_383609_384494_+	COG2273, SKN1, Beta-glucanase/Beta-glucan synthetase [Carbohydrate transport and metabolism]
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	4	631290-631427	2	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGACCACGCCGGTG	18	2	3	631308-631325|631308-631325|631392-631409	NC_021740.1_22466-22483|NC_021740.1_1410721-1410738|NC_021740.1_971891-971908	NA	3	3	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|54aa|up_7|NC_021740.1_622128_622290_-,NA|478aa|up_0|NC_021740.1_628305_629739_-,NA|450aa|down_2|NC_021740.1_633062_634412_-,NA|93aa|down_5|NC_021740.1_635942_636221_-	NA|325aa|up_9|NC_021740.1_619898_620873_+	TIGR03144, cytochrome_c_biogenesis_protein_chloroplast, cytochrome c-type biogenesis protein CcsB	NA|406aa|up_8|NC_021740.1_620914_622132_+	COG0455, flhG, Antiactivator of flagellar biosynthesis FleN, an ATPase [Cell motility]	NA|54aa|up_7|NC_021740.1_622128_622290_-	NA	NA|106aa|up_6|NC_021740.1_622336_622654_+	pfam14012, DUF4229, Protein of unknown function (DUF4229)	NA|595aa|up_5|NC_021740.1_622800_624585_+	pfam00934, PE, PE family	NA|336aa|up_4|NC_021740.1_624480_625488_-	TIGR00747, 3-oxoacyl-_synthase_3, 3-oxoacyl-(acyl-carrier-protein) synthase III	NA|293aa|up_3|NC_021740.1_625569_626448_-	TIGR00751, 14-dihydroxy-2-naphthoate_octaprenyltransferase, 1,4-dihydroxy-2-naphthoate octaprenyltransferase	NA|265aa|up_2|NC_021740.1_626464_627259_+	PRK07823, PRK07823, S-methyl-5'-thioadenosine phosphorylase	NA|347aa|up_1|NC_021740.1_627255_628296_+	cd05256, UDP_AE_SDR_e, UDP-N-acetylglucosamine 4-epimerase, extended (e) SDRs	NA|478aa|up_0|NC_021740.1_628305_629739_-	NA	NA|211aa|down_0|NC_021740.1_631750_632383_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|221aa|down_1|NC_021740.1_632379_633042_+	COG3222, COG3222, Uncharacterized protein conserved in bacteria [Function unknown]	NA|450aa|down_2|NC_021740.1_633062_634412_-	NA	NA|381aa|down_3|NC_021740.1_634423_635566_-	PRK07824, PRK07824, o-succinylbenzoate--CoA ligase	NA|101aa|down_4|NC_021740.1_635580_635883_-	pfam11829, DUF3349, Protein of unknown function (DUF3349)	NA|93aa|down_5|NC_021740.1_635942_636221_-	NA	NA|418aa|down_6|NC_021740.1_636217_637471_-	COG0306, PitA, Phosphate/sulphate permeases [Inorganic ion transport and metabolism]	NA|129aa|down_7|NC_021740.1_637590_637977_-	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|295aa|down_8|NC_021740.1_638039_638924_-	PRK05866, PRK05866, SDR family oxidoreductase	NA|301aa|down_9|NC_021740.1_639019_639922_-	PRK08321, PRK08321, 1,4-dihydroxy-2-naphthoyl-CoA synthase
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	5	692035-692111	2	CRISPRCasFinder	no	c2c9_V-U4	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type V-U4	TGAGGTGCGGCGTGAGCGCGGGT	23	0	0	NA	NA	NA	1	1	TypeV-U4	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|136aa|up_9|NC_021740.1_677930_678338_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|229aa|up_8|NC_021740.1_678397_679084_-	pfam10738, Lpp-LpqN, Probable lipoprotein LpqN	NA|878aa|up_7|NC_021740.1_679237_681871_+	COG3537, COG3537, Putative alpha-1,2-mannosidase [Carbohydrate transport and metabolism]	NA|796aa|up_6|NC_021740.1_681893_684281_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|241aa|up_5|NC_021740.1_684418_685141_+	COG2186, FadR, Transcriptional regulators [Transcription]	NA|266aa|up_4|NC_021740.1_685137_685935_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|296aa|up_3|NC_021740.1_685936_686824_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|405aa|up_2|NC_021740.1_686829_688044_+	pfam11887, Mce4_CUP1, Cholesterol uptake porter CUP1 of Mce4, putative	NA|344aa|up_1|NC_021740.1_688040_689072_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|482aa|up_0|NC_021740.1_689068_690514_+	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|517aa|down_0|NC_021740.1_693246_694797_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|131aa|down_1|NC_021740.1_694848_695241_-	cd18768, PIN_MtVapC4-C5-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4, VapC5, and related proteins	NA|86aa|down_2|NC_021740.1_695237_695495_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|412aa|down_3|NC_021740.1_695677_696913_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|138aa|down_4|NC_021740.1_697163_697577_-	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|79aa|down_5|NC_021740.1_697573_697810_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|169aa|down_6|NC_021740.1_697913_698420_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|157aa|down_7|NC_021740.1_698533_699004_-	PRK10755, PRK10755, two-component system sensor histidine kinase PmrB	NA|254aa|down_8|NC_021740.1_699047_699809_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|104aa|down_9|NC_021740.1_699865_700177_+	pfam03413, PepSY, Peptidase propeptide and YPEB domain
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	6	925273-926220	3	CRT	no	csa3	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type I-A	CGGNGCCGGCGGNNCCGGCGG	21	1	3	926056-926091|926056-926091|926056-926091	NC_021740.1_675137-675102|NC_021740.1_1212398-1212433|NC_021740.1_2417011-2416976	NA	19	19	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|184aa|down_2|NC_021740.1_930230_930782_-,NA|81aa|down_7|NC_021740.1_936244_936487_+	NA|685aa|up_9|NC_021740.1_912909_914964_-	TIGR00350, Transcriptional_regulator_LytR, cell envelope-related function transcriptional attenuator common domain	NA|390aa|up_8|NC_021740.1_915129_916299_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|339aa|up_7|NC_021740.1_916386_917403_-	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|214aa|up_6|NC_021740.1_917564_918206_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|352aa|up_5|NC_021740.1_918286_919342_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	csa3|131aa|up_4|NC_021740.1_919393_919786_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|141aa|up_3|NC_021740.1_919843_920266_-	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|97aa|up_2|NC_021740.1_920227_920518_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|302aa|up_1|NC_021740.1_920622_921528_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|272aa|up_0|NC_021740.1_921546_922362_-	TIGR04255, hypothetical_protein, TIGR04255 family protein	NA|883aa|down_0|NC_021740.1_926489_929138_-	pfam00934, PE, PE family	NA|215aa|down_1|NC_021740.1_929605_930250_+	pfam14032, PknH_C, PknH-like extracellular domain	NA|184aa|down_2|NC_021740.1_930230_930782_-	NA	NA|241aa|down_3|NC_021740.1_930862_931585_-	COG4849, COG4849, Predicted nucleotidyltransferase [General function prediction    only]	NA|257aa|down_4|NC_021740.1_933371_934142_+	pfam01427, Peptidase_M15, D-ala-D-ala dipeptidase	NA|271aa|down_5|NC_021740.1_934228_935041_+	pfam13847, Methyltransf_31, Methyltransferase domain	NA|287aa|down_6|NC_021740.1_935108_935969_-	TIGR01250, Proline_iminopeptidase, proline-specific peptidase, Bacillus coagulans-type subfamily	NA|81aa|down_7|NC_021740.1_936244_936487_+	NA	NA|431aa|down_8|NC_021740.1_936763_938056_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|335aa|down_9|NC_021740.1_938039_939044_+	COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	7	1189965-1190073	3	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GACAGCCAGCCGGCTGACCCGCCGT	25	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|544aa|up_9|NC_021740.1_1179332_1180964_+	cd12119, ttLC_FACS_AlkK_like, Fatty acyl-CoA synthetases similar to LC-FACS from Thermus thermophiles	NA|355aa|up_8|NC_021740.1_1181039_1182104_+	COG3804, COG3804, Uncharacterized conserved protein related to dihydrodipicolinate reductase [Function unknown]	NA|158aa|up_7|NC_021740.1_1182156_1182630_+	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|up_6|NC_021740.1_1182663_1183527_+	cd01908, YafJ, Glutamine amidotransferases class-II (Gn-AT)_YafJ-type	NA|286aa|up_5|NC_021740.1_1183531_1184389_+	COG1752, RssA, Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only]	NA|361aa|up_4|NC_021740.1_1184389_1185472_-	cd07228, Pat_NTE_like_bacteria, Bacterial patatin-like phospholipase domain containing protein 6	NA|140aa|up_3|NC_021740.1_1185552_1185972_-	pfam17301, LpqV, Putative lipoprotein LpqV	NA|189aa|up_2|NC_021740.1_1186083_1186650_+	cd10548, cupin_CDO, cysteine dioxygenase, cupin domain	NA|132aa|up_1|NC_021740.1_1186646_1187042_+	cd01447, Polysulfide_ST, Polysulfide-sulfurtransferase - Rhodanese Homology Domain	NA|668aa|up_0|NC_021740.1_1187069_1189073_-	pfam00934, PE, PE family	NA|588aa|down_0|NC_021740.1_1191158_1192922_-	COG4425, COG4425, Predicted membrane protein [Function unknown]	NA|258aa|down_1|NC_021740.1_1192918_1193692_-	PRK05862, PRK05862, enoyl-CoA hydratase; Provisional	NA|346aa|down_2|NC_021740.1_1193703_1194741_-	PRK05617, PRK05617, 3-hydroxyisobutyryl-CoA hydrolase; Provisional	NA|279aa|down_3|NC_021740.1_1194927_1195764_+	COG4760, COG4760, Predicted membrane protein [Function unknown]	NA|284aa|down_4|NC_021740.1_1195879_1196731_+	pfam18741, MTES_1575, REase_MTES_1575	NA|406aa|down_5|NC_021740.1_1196804_1198022_-	PRK07851, PRK07851, acetyl-CoA C-acetyltransferase	NA|315aa|down_6|NC_021740.1_1198074_1199019_-	cd01836, FeeA_FeeB_like, SGNH_hydrolase subfamily, FeeA, FeeB and similar esterases/lipases	NA|356aa|down_7|NC_021740.1_1199241_1200309_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|465aa|down_8|NC_021740.1_1200365_1201760_+	TIGR01137, Cystathionine_beta-synthase, cystathionine beta-synthase	NA|241aa|down_9|NC_021740.1_1201961_1202684_+	pfam06271, RDD, RDD family
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	8	1210599-1211455	4	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GGCGGTGTCGGCGGTGCCGGCGG	23	4	71	1210982-1211003|1211027-1211042|1211165-1211186|1211165-1211186|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300|1211282-1211300	NC_021740.1_2948343-2948322|NC_021740.1_2083200-2083185|NC_021740.1_837446-837467|NC_021740.1_1216293-1216314|NC_021740.1_674801-674783|NC_021740.1_840508-840526|NC_021740.1_1212125-1212143|NC_021740.1_1216266-1216284|NC_021740.1_1216338-1216356|NC_021740.1_1627997-1627979|NC_021740.1_1630823-1630805|NC_021740.1_1986759-1986741|NC_021740.1_2056128-2056110|NC_021740.1_2056575-2056557|NC_021740.1_2056674-2056656|NC_021740.1_2789893-2789875|NC_021740.1_2793421-2793403|NC_021740.1_150187-150205|NC_021740.1_333791-333773|NC_021740.1_335210-335192|NC_021740.1_338012-337994|NC_021740.1_338141-338123|NC_021740.1_338408-338390|NC_021740.1_338437-338455|NC_021740.1_338531-338513|NC_021740.1_363062-363044|NC_021740.1_443438-443420|NC_021740.1_546780-546762|NC_021740.1_623991-624009|NC_021740.1_624213-624231|NC_021740.1_673967-673949|NC_021740.1_840229-840247|NC_021740.1_1089629-1089647|NC_021740.1_1090277-1090295|NC_021740.1_1094335-1094317|NC_021740.1_1211654-1211672|NC_021740.1_1212044-1212062|NC_021740.1_1216686-1216704|NC_021740.1_1487866-1487848|NC_021740.1_1615961-1615943|NC_021740.1_1616414-1616396|NC_021740.1_1633336-1633318|NC_021740.1_1633345-1633327|NC_021740.1_1861622-1861604|NC_021740.1_1862150-1862132|NC_021740.1_1986033-1986015|NC_021740.1_1996040-1996058|NC_021740.1_2083398-2083380|NC_021740.1_2298094-2298076|NC_021740.1_2351541-2351523|NC_021740.1_2412837-2412855|NC_021740.1_2416924-2416906|NC_021740.1_2559703-2559685|NC_021740.1_2682647-2682665|NC_021740.1_2783213-2783195|NC_021740.1_2783795-2783777|NC_021740.1_3041023-3041041|NC_021740.1_3041122-3041140|NC_021740.1_3689223-3689241|NC_021740.1_3719846-3719828|NC_021740.1_3720470-3720452|NC_021740.1_3720713-3720695|NC_021740.1_3720846-3720828|NC_021740.1_3783352-3783370|NC_021740.1_3783535-3783553|NC_021740.1_3783703-3783721|NC_021740.1_3910154-3910172|NC_021740.1_3911884-3911902|NC_021740.1_3920391-3920409|NC_021740.1_4011369-4011351|NC_021740.1_4073895-4073913	NA	16	16	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|89aa|up_3|NC_021740.1_1206031_1206298_+,NA|61aa|down_3|NC_021740.1_1213873_1214056_+	NA|465aa|up_9|NC_021740.1_1200365_1201760_+	TIGR01137, Cystathionine_beta-synthase, cystathionine beta-synthase	NA|241aa|up_8|NC_021740.1_1201961_1202684_+	pfam06271, RDD, RDD family	NA|389aa|up_7|NC_021740.1_1202715_1203882_+	PRK07811, PRK07811, cystathionine gamma-synthase; Provisional	NA|165aa|up_6|NC_021740.1_1203952_1204447_-	PRK00226, greA, transcription elongation factor GreA; Reviewed	NA|145aa|up_5|NC_021740.1_1204632_1205067_-	pfam14155, DUF4307, Domain of unknown function (DUF4307)	NA|289aa|up_4|NC_021740.1_1205168_1206035_+	TIGR03446, mycothiol_Mca, mycothiol conjugate amidase Mca	NA|89aa|up_3|NC_021740.1_1206031_1206298_+	NA	NA|674aa|up_2|NC_021740.1_1206284_1208306_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|243aa|up_1|NC_021740.1_1208404_1209133_-	TIGR01065, Hypothetical_UPF0073_protein_yqfA	NA|263aa|up_0|NC_021740.1_1209243_1210032_+	PRK14828, PRK14828, undecaprenyl pyrophosphate synthase; Provisional	NA|107aa|down_0|NC_021740.1_1212688_1213009_+	COG0020, UppS, Undecaprenyl pyrophosphate synthase [Lipid metabolism]	NA|145aa|down_1|NC_021740.1_1213161_1213596_+	pfam00934, PE, PE family	NA|56aa|down_2|NC_021740.1_1213694_1213862_+	smart00637, CBD_II, CBD_II domain	NA|61aa|down_3|NC_021740.1_1213873_1214056_+	NA	NA|152aa|down_4|NC_021740.1_1214247_1214703_+	pfam01670, Glyco_hydro_12, Glycosyl hydrolase family 12	NA|854aa|down_5|NC_021740.1_1215117_1217679_+	pfam00934, PE, PE family	NA|313aa|down_6|NC_021740.1_1217896_1218835_-	PRK05439, PRK05439, pantothenate kinase; Provisional	NA|427aa|down_7|NC_021740.1_1219222_1220503_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|276aa|down_8|NC_021740.1_1220607_1221435_+	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|434aa|down_9|NC_021740.1_1221645_1222947_+	COG1875, COG1875, NYN ribonuclease and ATPase of PhoH family domains [General    function prediction only]
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	9	1569483-1570555	5	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGTTGCCGCCGGCACCGCCGTCGCCGAT	32	0	0	NA	NA	NA	15	15	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|106aa|up_7|NC_021740.1_1561246_1561564_+,NA	NA|103aa|up_9|NC_021740.1_1558761_1559070_+	pfam00934, PE, PE family	NA|540aa|up_8|NC_021740.1_1559066_1560686_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|106aa|up_7|NC_021740.1_1561246_1561564_+	NA	NA|209aa|up_6|NC_021740.1_1561698_1562325_+	PRK00300, gmk, guanylate kinase; Provisional	NA|111aa|up_5|NC_021740.1_1562390_1562723_+	TIGR00690, DNA-directed_RNA_polymerase_subunit_omega, DNA-directed RNA polymerase, omega subunit	NA|419aa|up_4|NC_021740.1_1562738_1563995_+	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated	NA|404aa|up_3|NC_021740.1_1564122_1565334_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|493aa|up_2|NC_021740.1_1565406_1566885_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|462aa|up_1|NC_021740.1_1566881_1568267_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|345aa|up_0|NC_021740.1_1568344_1569379_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|134aa|down_0|NC_021740.1_1571409_1571811_-	cd18741, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|86aa|down_1|NC_021740.1_1571807_1572065_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|320aa|down_2|NC_021740.1_1572147_1573107_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|321aa|down_3|NC_021740.1_1573131_1574094_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|201aa|down_4|NC_021740.1_1574227_1574830_+	COG3714, COG3714, Predicted membrane protein [Function unknown]	NA|656aa|down_5|NC_021740.1_1574910_1576878_+	PRK14873, PRK14873, primosomal protein N'	NA|275aa|down_6|NC_021740.1_1576895_1577720_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|161aa|down_7|NC_021740.1_1577888_1578371_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|275aa|down_8|NC_021740.1_1578442_1579267_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|313aa|down_9|NC_021740.1_1579463_1580402_+	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	10	1634200-1634388	4	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCGCCGGCCCCGCCGGC	18	1	2	1634299-1634316|1634299-1634316	NC_021740.1_2789800-2789817|NC_021740.1_3749092-3749109	NA	4	4	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|38aa|up_8|NC_021740.1_1620422_1620536_-,NA|137aa|up_7|NC_021740.1_1620585_1620996_-,NA|288aa|down_2|NC_021740.1_1637978_1638842_+	NA|162aa|up_9|NC_021740.1_1619505_1619991_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|38aa|up_8|NC_021740.1_1620422_1620536_-	NA	NA|137aa|up_7|NC_021740.1_1620585_1620996_-	NA	NA|248aa|up_6|NC_021740.1_1621012_1621756_-	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|304aa|up_5|NC_021740.1_1621752_1622664_-	TIGR00534, Putative_OxPP_cycle_protein_OpcA, glucose-6-phosphate dehydrogenase assembly protein OpcA	NA|515aa|up_4|NC_021740.1_1622716_1624261_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|374aa|up_3|NC_021740.1_1624257_1625379_-	PRK03343, PRK03343, transaldolase; Validated	NA|701aa|up_2|NC_021740.1_1625395_1627498_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|1330aa|up_1|NC_021740.1_1627936_1631926_-	pfam00934, PE, PE family	NA|309aa|up_0|NC_021740.1_1632327_1633254_+	PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional	NA|422aa|down_0|NC_021740.1_1635679_1636945_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|329aa|down_1|NC_021740.1_1636972_1637959_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|288aa|down_2|NC_021740.1_1637978_1638842_+	NA	NA|311aa|down_3|NC_021740.1_1638791_1639724_-	COG1612, CtaA, Uncharacterized protein required for cytochrome oxidase assembly [Posttranslational modification, protein turnover, chaperones]	NA|262aa|down_4|NC_021740.1_1639835_1640621_-	TIGR00025, Mtu_efflux, ABC transporter efflux protein, DrrB family	NA|314aa|down_5|NC_021740.1_1640617_1641559_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|592aa|down_6|NC_021740.1_1641661_1643437_-	TIGR03459, crt_membr, carotene biosynthesis associated membrane protein	NA|269aa|down_7|NC_021740.1_1643484_1644291_+	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|847aa|down_8|NC_021740.1_1644287_1646828_+	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB	NA|398aa|down_9|NC_021740.1_1646824_1648018_+	COG0719, SufB, Cysteine desulfurase activator SufB [Posttranslational modification, protein turnover, chaperones]
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	11	2082870-2083124	5	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCNCCGTCGCCGCCNNTGCC	21	2	3	2082951-2082968|2082951-2082968|2083041-2083058	NC_021740.1_401087-401070|NC_021740.1_607446-607429|NC_021740.1_3436642-3436625	NA	5	5	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|88aa|up_0|NC_021740.1_2082229_2082493_-,NA	NA|165aa|up_9|NC_021740.1_2068523_2069018_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|226aa|up_8|NC_021740.1_2069421_2070099_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|942aa|up_7|NC_021740.1_2070457_2073283_+	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|287aa|up_6|NC_021740.1_2073509_2074370_-	PRK03204, PRK03204, haloalkane dehalogenase; Provisional	NA|289aa|up_5|NC_021740.1_2074410_2075277_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|629aa|up_4|NC_021740.1_2075281_2077168_-	TIGR00976, Hypothetical_protein_Rv1835c/MT1883/Mb1866c	NA|678aa|up_3|NC_021740.1_2077183_2079217_-	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|742aa|up_2|NC_021740.1_2079336_2081562_-	PRK02999, PRK02999, malate synthase G; Provisional	NA|132aa|up_1|NC_021740.1_2081837_2082233_-	COG1848, COG1848, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|88aa|up_0|NC_021740.1_2082229_2082493_-	NA	NA|346aa|down_0|NC_021740.1_2084261_2085299_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|456aa|down_1|NC_021740.1_2085298_2086666_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|479aa|down_2|NC_021740.1_2086839_2088276_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|486aa|down_3|NC_021740.1_2088311_2089769_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|317aa|down_4|NC_021740.1_2089798_2090749_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|139aa|down_5|NC_021740.1_2090763_2091180_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|141aa|down_6|NC_021740.1_2091457_2091880_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|101aa|down_7|NC_021740.1_2091928_2092231_+	pfam00547, Urease_gamma, Urease, gamma subunit	NA|105aa|down_8|NC_021740.1_2092227_2092542_+	PRK13202, ureB, urease subunit beta; Reviewed	NA|578aa|down_9|NC_021740.1_2092541_2094275_+	PRK13206, ureC, urease subunit alpha; Reviewed
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	12	2163397-2163616	6	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GTGCCAGCCGGAATCGTGATCGGCGGAACCGTCACCGACGGAATACTCA	49	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|157aa|up_9|NC_021740.1_2147007_2147478_-,NA|136aa|up_2|NC_021740.1_2154502_2154910_-,NA|127aa|down_5|NC_021740.1_2171132_2171513_-	NA|157aa|up_9|NC_021740.1_2147007_2147478_-	NA	NA|216aa|up_8|NC_021740.1_2147817_2148465_-	pfam14081, DUF4262, Domain of unknown function (DUF4262)	NA|741aa|up_7|NC_021740.1_2148471_2150694_-	PRK15061, PRK15061, catalase/peroxidase	NA|148aa|up_6|NC_021740.1_2150731_2151175_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|198aa|up_5|NC_021740.1_2151288_2151882_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|202aa|up_4|NC_021740.1_2151964_2152570_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|251aa|up_3|NC_021740.1_2153772_2154525_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|136aa|up_2|NC_021740.1_2154502_2154910_-	NA	NA|767aa|up_1|NC_021740.1_2155044_2157345_+	PLN02892, PLN02892, isocitrate lyase	NA|1460aa|up_0|NC_021740.1_2157514_2161894_-	pfam00823, PPE, PPE family	NA|155aa|down_0|NC_021740.1_2165643_2166108_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|down_1|NC_021740.1_2166205_2167069_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|424aa|down_2|NC_021740.1_2167106_2168378_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|372aa|down_3|NC_021740.1_2168649_2169765_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|447aa|down_4|NC_021740.1_2169755_2171096_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|127aa|down_5|NC_021740.1_2171132_2171513_-	NA	NA|621aa|down_6|NC_021740.1_2171669_2173532_+	PRK12476, PRK12476, putative fatty-acid--CoA ligase; Provisional	NA|160aa|down_7|NC_021740.1_2173539_2174019_-	pfam09167, DUF1942, Domain of unknown function (DUF1942)	NA|258aa|down_8|NC_021740.1_2174255_2175029_+	COG3361, COG3361, Uncharacterized conserved protein [Function unknown]	NA|256aa|down_9|NC_021740.1_2175032_2175800_-	PRK05867, PRK05867, SDR family oxidoreductase
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	13	3105615-3106969	2,7,6	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-A,Type III-C,Type III-B,Type III-D	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	17,17,18	18	TypeIII-A,TypeIII-C,TypeIII-B,TypeIII-D	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_8|NC_021740.1_3098894_3099149_-,NA|135aa|up_7|NC_021740.1_3099296_3099701_+,NA|64aa|up_6|NC_021740.1_3099697_3099889_+,NA|86aa|up_4|NC_021740.1_3101475_3101733_+,NA|104aa|up_3|NC_021740.1_3101837_3102149_+,NA|203aa|up_2|NC_021740.1_3102568_3103177_+,NA	NA|92aa|up_9|NC_021740.1_3098443_3098719_+	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|85aa|up_8|NC_021740.1_3098894_3099149_-	NA	NA|135aa|up_7|NC_021740.1_3099296_3099701_+	NA	NA|64aa|up_6|NC_021740.1_3099697_3099889_+	NA	NA|385aa|up_5|NC_021740.1_3100087_3101242_+	pfam00665, rve, Integrase core domain	NA|86aa|up_4|NC_021740.1_3101475_3101733_+	NA	NA|104aa|up_3|NC_021740.1_3101837_3102149_+	NA	NA|203aa|up_2|NC_021740.1_3102568_3103177_+	NA	NA|470aa|up_1|NC_021740.1_3103247_3104657_+	pfam00665, rve, Integrase core domain	NA|271aa|up_0|NC_021740.1_3104653_3105466_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|down_0|NC_021740.1_3106995_3108257_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_1|NC_021740.1_3110054_3110396_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_2|NC_021740.1_3110396_3111413_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	csm6|383aa|down_3|NC_021740.1_3111425_3112574_-	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm5gr7|376aa|down_4|NC_021740.1_3112669_3113797_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|303aa|down_5|NC_021740.1_3113793_3114702_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_6|NC_021740.1_3114682_3115393_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_7|NC_021740.1_3115402_3115777_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|810aa|down_8|NC_021740.1_3115773_3118203_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_9|NC_021740.1_3118199_3118922_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	14	3108292-3110006	8,7,3	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-A,Type III-C,Type III-B,Type III-D	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	23,23,22	23	TypeIII-A,TypeIII-C,TypeIII-B,TypeIII-D	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_9|NC_021740.1_3098894_3099149_-,NA|135aa|up_8|NC_021740.1_3099296_3099701_+,NA|64aa|up_7|NC_021740.1_3099697_3099889_+,NA|86aa|up_5|NC_021740.1_3101475_3101733_+,NA|104aa|up_4|NC_021740.1_3101837_3102149_+,NA|203aa|up_3|NC_021740.1_3102568_3103177_+,NA	NA|85aa|up_9|NC_021740.1_3098894_3099149_-	NA	NA|135aa|up_8|NC_021740.1_3099296_3099701_+	NA	NA|64aa|up_7|NC_021740.1_3099697_3099889_+	NA	NA|385aa|up_6|NC_021740.1_3100087_3101242_+	pfam00665, rve, Integrase core domain	NA|86aa|up_5|NC_021740.1_3101475_3101733_+	NA	NA|104aa|up_4|NC_021740.1_3101837_3102149_+	NA	NA|203aa|up_3|NC_021740.1_3102568_3103177_+	NA	NA|470aa|up_2|NC_021740.1_3103247_3104657_+	pfam00665, rve, Integrase core domain	NA|271aa|up_1|NC_021740.1_3104653_3105466_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|up_0|NC_021740.1_3106995_3108257_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_0|NC_021740.1_3110054_3110396_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_1|NC_021740.1_3110396_3111413_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	csm6|383aa|down_2|NC_021740.1_3111425_3112574_-	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm5gr7|376aa|down_3|NC_021740.1_3112669_3113797_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|303aa|down_4|NC_021740.1_3113793_3114702_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_5|NC_021740.1_3114682_3115393_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_6|NC_021740.1_3115402_3115777_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|810aa|down_7|NC_021740.1_3115773_3118203_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_8|NC_021740.1_3118199_3118922_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_9|NC_021740.1_3119321_3119867_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	15	3723040-3723581	8	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGTTNCCNCCGTTGCCGCC	23	0	0	NA	NA	NA	7	7	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|96aa|down_4|NC_021740.1_3737681_3737969_-,NA|88aa|down_9|NC_021740.1_3751089_3751353_-	NA|282aa|up_9|NC_021740.1_3702290_3703136_-	pfam05305, DUF732, Protein of unknown function (DUF732)	NA|147aa|up_8|NC_021740.1_3703610_3704051_+	cd04770, HTH_HMRTR, Helix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators	NA|290aa|up_7|NC_021740.1_3704084_3704954_-	TIGR00766, Uncharacterized_protein_Dda3937_02003, inner membrane protein YhjD	NA|337aa|up_6|NC_021740.1_3704974_3705985_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|274aa|up_5|NC_021740.1_3706081_3706903_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|410aa|up_4|NC_021740.1_3706969_3708199_-	PRK08299, PRK08299, NADP-dependent isocitrate dehydrogenase	NA|450aa|up_3|NC_021740.1_3708481_3709831_+	PRK07812, PRK07812, O-acetylhomoserine aminocarboxypropyltransferase; Validated	NA|380aa|up_2|NC_021740.1_3709842_3710982_+	PRK00175, metX, homoserine O-acetyltransferase; Provisional	NA|244aa|up_1|NC_021740.1_3710978_3711710_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|2524aa|up_0|NC_021740.1_3711718_3719290_-	pfam00823, PPE, PPE family	NA|86aa|down_0|NC_021740.1_3725552_3725810_-	pfam11222, DUF3017, Protein of unknown function (DUF3017)	NA|3158aa|down_1|NC_021740.1_3726065_3735539_-	pfam00823, PPE, PPE family	NA|149aa|down_2|NC_021740.1_3736163_3736610_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|247aa|down_3|NC_021740.1_3736646_3737387_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|96aa|down_4|NC_021740.1_3737681_3737969_-	NA	NA|3717aa|down_5|NC_021740.1_3738305_3749456_-	pfam00823, PPE, PPE family	NA|265aa|down_6|NC_021740.1_3749699_3750494_-	pfam08031, BBE, Berberine and berberine like	NA|124aa|down_7|NC_021740.1_3750575_3750947_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|73aa|down_8|NC_021740.1_3750844_3751063_-	pfam01565, FAD_binding_4, FAD binding domain	NA|88aa|down_9|NC_021740.1_3751089_3751353_-	NA
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	16	3927999-3929750	9	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CANCGGCGGCACCGGCGGCNNCGGCGGNNNCGGCGG	36	2	9	3928590-3928625|3928590-3928625|3928590-3928625|3928590-3928625|3928590-3928625|3929481-3929507|3929481-3929507|3929481-3929507|3929481-3929507	NC_021740.1_3926706-3926741|NC_021740.1_3927048-3927083|NC_021740.1_3927639-3927674|NC_021740.1_3927990-3928025|NC_021740.1_3912439-3912474|NC_021740.1_362002-362028|NC_021740.1_3920892-3920918|NC_021740.1_3921228-3921254|NC_021740.1_3921574-3921600	NA	21	21	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|134aa|up_2|NC_021740.1_3922732_3923134_-,NA|384aa|up_1|NC_021740.1_3923446_3924598_+,NA|280aa|down_2|NC_021740.1_3933065_3933905_+	NA|503aa|up_9|NC_021740.1_3904524_3906033_+	PRK07867, PRK07867, acyl-CoA synthetase; Validated	NA|1382aa|up_8|NC_021740.1_3906203_3910349_+	pfam00934, PE, PE family	NA|1902aa|up_7|NC_021740.1_3910639_3916345_+	pfam00934, PE, PE family	NA|516aa|up_6|NC_021740.1_3916511_3918059_-	PRK07586, PRK07586, acetolactate synthase large subunit	NA|279aa|up_5|NC_021740.1_3918055_3918892_-	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|715aa|up_4|NC_021740.1_3919251_3921396_+	pfam00934, PE, PE family	NA|446aa|up_3|NC_021740.1_3921355_3922693_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|134aa|up_2|NC_021740.1_3922732_3923134_-	NA	NA|384aa|up_1|NC_021740.1_3923446_3924598_+	NA	NA|219aa|up_0|NC_021740.1_3924726_3925383_-	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|549aa|down_0|NC_021740.1_3930458_3932105_-	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|264aa|down_1|NC_021740.1_3932178_3932970_+	PRK07799, PRK07799, crotonase/enoyl-CoA hydratase family protein	NA|280aa|down_2|NC_021740.1_3933065_3933905_+	NA	NA|399aa|down_3|NC_021740.1_3933959_3935156_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|237aa|down_4|NC_021740.1_3935184_3935895_+	pfam06314, ADC, Acetoacetate decarboxylase (ADC)	NA|348aa|down_5|NC_021740.1_3935959_3937003_-	TIGR03559, F420_Rv3520c, probable F420-dependent oxidoreductase, Rv3520c family	NA|304aa|down_6|NC_021740.1_3937155_3938067_+	COG1545, COG1545, Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]	NA|355aa|down_7|NC_021740.1_3938082_3939147_+	PRK07937, PRK07937, lipid-transfer protein; Provisional	NA|395aa|down_8|NC_021740.1_3939163_3940348_+	PRK08313, PRK08313, thiolase domain-containing protein	NA|344aa|down_9|NC_021740.1_3940389_3941421_+	cd14952, NHL_PKND_like, NHL repeat domain of the protein kinase PknD
GCF_000422125.1_ASM42212v1	NC_021740	Mycobacterium tuberculosis EAI5, complete sequence	17	4090312-4090400	9	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCTCGGCGACGATGCGGGCCGGATGACGGCC	31	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|257aa|up_6|NC_021740.1_4080898_4081669_-,NA|233aa|up_0|NC_021740.1_4089416_4090115_-,NA|126aa|down_6|NC_021740.1_4095635_4096013_+	NA|388aa|up_9|NC_021740.1_4076569_4077733_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|up_8|NC_021740.1_4077729_4078782_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|up_7|NC_021740.1_4079280_4080144_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|up_6|NC_021740.1_4080898_4081669_-	NA	NA|549aa|up_5|NC_021740.1_4081665_4083312_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|288aa|up_4|NC_021740.1_4083308_4084172_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|up_3|NC_021740.1_4084164_4085091_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|542aa|up_2|NC_021740.1_4085092_4086718_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|652aa|up_1|NC_021740.1_4087425_4089381_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|233aa|up_0|NC_021740.1_4089416_4090115_-	NA	NA|173aa|down_0|NC_021740.1_4090460_4090979_+	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|328aa|down_1|NC_021740.1_4090979_4091963_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|398aa|down_2|NC_021740.1_4091955_4093149_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|274aa|down_3|NC_021740.1_4093154_4093976_-	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|228aa|down_4|NC_021740.1_4094107_4094791_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|246aa|down_5|NC_021740.1_4094790_4095528_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|126aa|down_6|NC_021740.1_4095635_4096013_+	NA	NA|225aa|down_7|NC_021740.1_4096111_4096786_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|265aa|down_8|NC_021740.1_4096891_4097686_-	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|152aa|down_9|NC_021740.1_4097692_4098148_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
