assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	1	334145-334256	1	PILER-CR	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCG---------TTGCCGCCGTTGCCGATCA	34	1	6	334218-334234|334218-334234|334218-334234|334218-334234|334218-334234|334218-334234	NC_020559.1_840558-840542|NC_020559.1_1188645-1188661|NC_020559.1_1211982-1211966|NC_020559.1_2416485-2416501|NC_020559.1_2792776-2792792|NC_020559.1_4018008-4018024	NA	2	2	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|61aa|up_0|NC_020559.1_332475_332658_-,NA	NA|398aa|up_9|NC_020559.1_322653_323847_-	COG3285, COG3285, Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]	NA|561aa|up_8|NC_020559.1_323882_325565_+	PRK07788, PRK07788, acyl-CoA synthetase; Validated	NA|732aa|up_7|NC_020559.1_325581_327777_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|378aa|up_6|NC_020559.1_327890_329024_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|207aa|up_5|NC_020559.1_329020_329641_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|194aa|up_4|NC_020559.1_329737_330319_+	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|242aa|up_3|NC_020559.1_330248_330974_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|307aa|up_2|NC_020559.1_331063_331984_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|143aa|up_1|NC_020559.1_332023_332452_-	cd18678, PIN_MtVapC25_VapC33-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC25, VapC33, and related proteins	NA|61aa|up_0|NC_020559.1_332475_332658_-	NA	NA|900aa|down_0|NC_020559.1_336753_339453_-	pfam00934, PE, PE family	NA|537aa|down_1|NC_020559.1_339743_341354_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|303aa|down_2|NC_020559.1_341377_342286_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|632aa|down_3|NC_020559.1_342509_344405_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|539aa|down_4|NC_020559.1_344401_346018_+	pfam05108, T7SS_ESX1_EccB, Type VII secretion system ESX-1, transport TM domain B	NA|1331aa|down_5|NC_020559.1_346014_350007_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|103aa|down_6|NC_020559.1_350003_350312_+	pfam00934, PE, PE family	NA|514aa|down_7|NC_020559.1_350314_351856_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|98aa|down_8|NC_020559.1_351904_352198_+	TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target	NA|97aa|down_9|NC_020559.1_352227_352518_+	COG4842, COG4842, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	2	366833-367531	1	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	TTCGCGAAGCCGATGTTGTAGCTGCCGGTGTTG	33	2	2	367121-367162|367196-367228	NC_020559.1_375152-375193|NC_020559.1_373967-373999	NA	10	10	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|101aa|up_4|NC_020559.1_363846_364149_+,NA|161aa|down_1|NC_020559.1_376941_377424_-,NA|410aa|down_5|NC_020559.1_379540_380770_+	NA|401aa|up_9|NC_020559.1_357233_358436_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|262aa|up_8|NC_020559.1_358542_359328_+	PRK14103, PRK14103, trans-aconitate 2-methyltransferase; Provisional	NA|268aa|up_7|NC_020559.1_359316_360120_-	COG4424, COG4424, Uncharacterized protein conserved in bacteria [Function unknown]	NA|466aa|up_6|NC_020559.1_360129_361527_-	cd16027, SGSH, N-sulfoglucosamine sulfohydrolase (SGSH; sulfamidase)	NA|76aa|up_5|NC_020559.1_363622_363850_+	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|101aa|up_4|NC_020559.1_363846_364149_+	NA	NA|74aa|up_3|NC_020559.1_364196_364418_+	PHA01748, PHA01748, hypothetical protein	NA|142aa|up_2|NC_020559.1_364414_364840_+	cd18755, PIN_MtVapC3_VapC21-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC3, VapC21 and related proteins	NA|193aa|up_1|NC_020559.1_365028_365607_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|303aa|up_0|NC_020559.1_365603_366512_+	cd09810, LPOR_like_SDR_c_like, light-dependent protochlorophyllide reductase (LPOR)-like, classical (c)-like SDRs	NA|224aa|down_0|NC_020559.1_376282_376954_+	TIGR02476, BluB, 5,6-dimethylbenzimidazole synthase	NA|161aa|down_1|NC_020559.1_376941_377424_-	NA	NA|239aa|down_2|NC_020559.1_377481_378198_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|219aa|down_3|NC_020559.1_378299_378956_+	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|164aa|down_4|NC_020559.1_379025_379517_-	pfam13577, SnoaL_4, SnoaL-like domain	NA|410aa|down_5|NC_020559.1_379540_380770_+	NA	NA|621aa|down_6|NC_020559.1_380924_382787_+	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|129aa|down_7|NC_020559.1_382858_383245_+	COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational modification, protein turnover, chaperones]	NA|212aa|down_8|NC_020559.1_383247_383883_-	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|295aa|down_9|NC_020559.1_383970_384855_+	COG2273, SKN1, Beta-glucanase/Beta-glucan synthetase [Carbohydrate transport and metabolism]
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	3	692990-693066	2	CRISPRCasFinder	no	c2c9_V-U4	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type V-U4	TGAGGTGCGGCGTGAGCGCGGGT	23	0	0	NA	NA	NA	1	1	TypeV-U4	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|136aa|up_9|NC_020559.1_678886_679294_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|229aa|up_8|NC_020559.1_679353_680040_-	pfam10738, Lpp-LpqN, Probable lipoprotein LpqN	NA|878aa|up_7|NC_020559.1_680193_682827_+	COG3537, COG3537, Putative alpha-1,2-mannosidase [Carbohydrate transport and metabolism]	NA|796aa|up_6|NC_020559.1_682849_685237_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|241aa|up_5|NC_020559.1_685373_686096_+	COG2186, FadR, Transcriptional regulators [Transcription]	NA|266aa|up_4|NC_020559.1_686092_686890_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|296aa|up_3|NC_020559.1_686891_687779_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|405aa|up_2|NC_020559.1_687784_688999_+	pfam11887, Mce4_CUP1, Cholesterol uptake porter CUP1 of Mce4, putative	NA|482aa|up_1|NC_020559.1_690022_691468_+	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|479aa|up_0|NC_020559.1_691464_692901_+	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|517aa|down_0|NC_020559.1_694201_695752_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|131aa|down_1|NC_020559.1_695803_696196_-	cd18768, PIN_MtVapC4-C5-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4, VapC5, and related proteins	NA|86aa|down_2|NC_020559.1_696192_696450_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|412aa|down_3|NC_020559.1_696632_697868_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|138aa|down_4|NC_020559.1_698118_698532_-	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|79aa|down_5|NC_020559.1_698528_698765_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|169aa|down_6|NC_020559.1_698868_699375_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|157aa|down_7|NC_020559.1_699488_699959_-	PRK10755, PRK10755, two-component system sensor histidine kinase PmrB	NA|254aa|down_8|NC_020559.1_700002_700764_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|104aa|down_9|NC_020559.1_700820_701132_+	pfam03413, PepSY, Peptidase propeptide and YPEB domain
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	4	962068-962319	2	PILER-CR	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	ACCTGGCGCCACCCGCGCCCGCCGAACTGGCGCCACCCGCGCCCGCCGACCTGGCACCACCCG	63	1	1	962241-962296	NC_020559.1_962369-962424	NA	2	2	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|67aa|up_3|NC_020559.1_960166_960367_+,NA	NA|158aa|up_9|NC_020559.1_950690_951164_+	cd07819, SRPBCC_2, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|398aa|up_8|NC_020559.1_951160_952354_-	PRK07777, PRK07777, putative succinyldiaminopimelate transaminase DapC	NA|404aa|up_7|NC_020559.1_952510_953722_+	PRK08242, PRK08242, acetyl-CoA C-acetyltransferase	NA|721aa|up_6|NC_020559.1_953726_955889_+	PRK11730, fadB, fatty acid oxidation complex subunit alpha FadB	NA|543aa|up_5|NC_020559.1_955956_957585_-	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|757aa|up_4|NC_020559.1_957828_960099_-	pfam13625, Helicase_C_3, Helicase conserved C-terminal domain	NA|67aa|up_3|NC_020559.1_960166_960367_+	NA	NA|168aa|up_2|NC_020559.1_960376_960880_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|161aa|up_1|NC_020559.1_960876_961359_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|142aa|up_0|NC_020559.1_961355_961781_+	COG0314, MoaE, Molybdopterin converting factor, large subunit [Coenzyme metabolism]	NA|93aa|down_0|NC_020559.1_963469_963748_-	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|361aa|down_1|NC_020559.1_963751_964834_-	PRK00164, moaA, GTP 3',8-cyclase MoaA	NA|130aa|down_2|NC_020559.1_964830_965220_-	PRK11770, PRK11770, YccF domain-containing protein	NA|136aa|down_3|NC_020559.1_965384_965792_+	COG1278, CspC, Cold shock proteins [Transcription]	NA|610aa|down_4|NC_020559.1_965910_967740_-	pfam00934, PE, PE family	NA|651aa|down_5|NC_020559.1_968000_969953_+	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|387aa|down_6|NC_020559.1_970041_971202_-	COG4398, COG4398, Uncharacterized protein conserved in bacteria [Function unknown]	NA|163aa|down_7|NC_020559.1_971301_971790_-	pfam10969, DUF2771, Protein of unknown function (DUF2771)	NA|549aa|down_8|NC_020559.1_971786_973433_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|263aa|down_9|NC_020559.1_973570_974359_+	pfam11228, DUF3027, Protein of unknown function (DUF3027)
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	5	1090695-1090856	3	PILER-CR	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CGGTGCCGGCGGCACCGGCGG	21	0	0	NA	NA	NA	2	2	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|65aa|down_0|NC_020559.1_1093283_1093478_-	NA|120aa|up_9|NC_020559.1_1076827_1077187_+	cd10151, TthCsoR-like_DUF156, Thermus thermophilus CsoR, a Cu(I)-sensing transcriptional regulator, and related domains; this domain family was previously known as part of DUF156	NA|99aa|up_8|NC_020559.1_1077243_1077540_+	pfam07371, DUF1490, Protein of unknown function (DUF1490)	NA|771aa|up_7|NC_020559.1_1077595_1079908_+	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|211aa|up_6|NC_020559.1_1079904_1080537_+	pfam17197, DUF5134, Domain of unknown function (DUF5134)	NA|270aa|up_5|NC_020559.1_1080627_1081437_-	PRK07827, PRK07827, enoyl-CoA hydratase family protein	NA|389aa|up_4|NC_020559.1_1081436_1082603_-	cd00567, ACAD, Acyl-CoA dehydrogenase	NA|668aa|up_3|NC_020559.1_1082599_1084603_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|530aa|up_2|NC_020559.1_1084608_1086198_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|383aa|up_1|NC_020559.1_1086200_1087349_-	cd01160, LCAD, Long chain acyl-CoA dehydrogenase	NA|561aa|up_0|NC_020559.1_1087345_1089028_-	pfam07287, AtuA, Acyclic terpene utilisation family protein AtuA	NA|65aa|down_0|NC_020559.1_1093283_1093478_-	NA	NA|58aa|down_1|NC_020559.1_1093499_1093673_+	PRK01110, rpmF, 50S ribosomal protein L32; Validated	NA|186aa|down_2|NC_020559.1_1093691_1094249_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|229aa|down_3|NC_020559.1_1095874_1096561_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|505aa|down_4|NC_020559.1_1096560_1098075_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|465aa|down_5|NC_020559.1_1098118_1099513_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|182aa|down_6|NC_020559.1_1099512_1100058_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|152aa|down_7|NC_020559.1_1100077_1100533_-	PRK00567, mscL, large-conductance mechanosensitive channel protein MscL	NA|249aa|down_8|NC_020559.1_1100855_1101602_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|387aa|down_9|NC_020559.1_1104167_1105328_+	COG5621, COG5621, Predicted secreted hydrolase [General function prediction only]
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	6	1094365-1094525	4	PILER-CR	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGTCAGGGATGGAGCCGGTGACGGTGTTGGTG	33	0	0	NA	NA	NA	2	2	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|65aa|up_2|NC_020559.1_1093283_1093478_-,NA	NA|270aa|up_9|NC_020559.1_1080627_1081437_-	PRK07827, PRK07827, enoyl-CoA hydratase family protein	NA|389aa|up_8|NC_020559.1_1081436_1082603_-	cd00567, ACAD, Acyl-CoA dehydrogenase	NA|668aa|up_7|NC_020559.1_1082599_1084603_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|530aa|up_6|NC_020559.1_1084608_1086198_-	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|383aa|up_5|NC_020559.1_1086200_1087349_-	cd01160, LCAD, Long chain acyl-CoA dehydrogenase	NA|561aa|up_4|NC_020559.1_1087345_1089028_-	pfam07287, AtuA, Acyclic terpene utilisation family protein AtuA	NA|659aa|up_3|NC_020559.1_1089225_1091202_+	pfam00934, PE, PE family	NA|65aa|up_2|NC_020559.1_1093283_1093478_-	NA	NA|58aa|up_1|NC_020559.1_1093499_1093673_+	PRK01110, rpmF, 50S ribosomal protein L32; Validated	NA|186aa|up_0|NC_020559.1_1093691_1094249_-	COG3391, COG3391, Uncharacterized conserved protein [Function unknown]	NA|229aa|down_0|NC_020559.1_1095874_1096561_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|505aa|down_1|NC_020559.1_1096560_1098075_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|465aa|down_2|NC_020559.1_1098118_1099513_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|182aa|down_3|NC_020559.1_1099512_1100058_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|152aa|down_4|NC_020559.1_1100077_1100533_-	PRK00567, mscL, large-conductance mechanosensitive channel protein MscL	NA|249aa|down_5|NC_020559.1_1100855_1101602_+	cd03255, ABC_MJ0796_LolCDE_FtsE, ATP-binding cassette domain of the transporters involved in export of lipoprotein and macrolide, and cell division protein	NA|387aa|down_6|NC_020559.1_1104167_1105328_+	COG5621, COG5621, Predicted secreted hydrolase [General function prediction only]	NA|326aa|down_7|NC_020559.1_1105456_1106434_-	COG0142, IspA, Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism]	NA|219aa|down_8|NC_020559.1_1106494_1107151_-	cd11614, SAF_CpaB_FlgA_like, SAF domains of the flagella basal body P-ring formation protein FlgA and the flp pilus assembly CpaB	NA|111aa|down_9|NC_020559.1_1107223_1107556_-	COG2331, COG2331, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	7	1625752-1625836	3	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCGCCCTTACCGCCGGCGCCGCCAGC	27	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|38aa|up_6|NC_020559.1_1616134_1616248_-,NA|137aa|up_5|NC_020559.1_1616297_1616708_-,NA|288aa|down_3|NC_020559.1_1633455_1634319_+	NA|473aa|up_9|NC_020559.1_1611276_1612695_-	pfam00934, PE, PE family	NA|767aa|up_8|NC_020559.1_1612801_1615102_+	TIGR00509, Trimethylamine-N-oxide_reductase_1, molybdopterin guanine dinucleotide-containing S/N-oxide reductases	NA|162aa|up_7|NC_020559.1_1615217_1615703_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|38aa|up_6|NC_020559.1_1616134_1616248_-	NA	NA|137aa|up_5|NC_020559.1_1616297_1616708_-	NA	NA|248aa|up_4|NC_020559.1_1616724_1617468_-	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|304aa|up_3|NC_020559.1_1617464_1618376_-	TIGR00534, Putative_OxPP_cycle_protein_OpcA, glucose-6-phosphate dehydrogenase assembly protein OpcA	NA|515aa|up_2|NC_020559.1_1618428_1619973_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|374aa|up_1|NC_020559.1_1619969_1621091_-	PRK03343, PRK03343, transaldolase; Validated	NA|701aa|up_0|NC_020559.1_1621107_1623210_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|309aa|down_0|NC_020559.1_1627817_1628744_+	PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional	NA|422aa|down_1|NC_020559.1_1631156_1632422_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|329aa|down_2|NC_020559.1_1632449_1633436_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|288aa|down_3|NC_020559.1_1633455_1634319_+	NA	NA|311aa|down_4|NC_020559.1_1634268_1635201_-	COG1612, CtaA, Uncharacterized protein required for cytochrome oxidase assembly [Posttranslational modification, protein turnover, chaperones]	NA|262aa|down_5|NC_020559.1_1635312_1636098_-	TIGR00025, Mtu_efflux, ABC transporter efflux protein, DrrB family	NA|314aa|down_6|NC_020559.1_1636094_1637036_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|592aa|down_7|NC_020559.1_1637191_1638967_-	TIGR03459, crt_membr, carotene biosynthesis associated membrane protein	NA|269aa|down_8|NC_020559.1_1639014_1639821_+	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|847aa|down_9|NC_020559.1_1639817_1642358_+	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	8	1625920-1626022	4	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCGCCCTTACCGCCGGCGCCGCCAGC	27	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|38aa|up_6|NC_020559.1_1616134_1616248_-,NA|137aa|up_5|NC_020559.1_1616297_1616708_-,NA|288aa|down_3|NC_020559.1_1633455_1634319_+	NA|473aa|up_9|NC_020559.1_1611276_1612695_-	pfam00934, PE, PE family	NA|767aa|up_8|NC_020559.1_1612801_1615102_+	TIGR00509, Trimethylamine-N-oxide_reductase_1, molybdopterin guanine dinucleotide-containing S/N-oxide reductases	NA|162aa|up_7|NC_020559.1_1615217_1615703_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|38aa|up_6|NC_020559.1_1616134_1616248_-	NA	NA|137aa|up_5|NC_020559.1_1616297_1616708_-	NA	NA|248aa|up_4|NC_020559.1_1616724_1617468_-	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|304aa|up_3|NC_020559.1_1617464_1618376_-	TIGR00534, Putative_OxPP_cycle_protein_OpcA, glucose-6-phosphate dehydrogenase assembly protein OpcA	NA|515aa|up_2|NC_020559.1_1618428_1619973_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|374aa|up_1|NC_020559.1_1619969_1621091_-	PRK03343, PRK03343, transaldolase; Validated	NA|701aa|up_0|NC_020559.1_1621107_1623210_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|309aa|down_0|NC_020559.1_1627817_1628744_+	PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional	NA|422aa|down_1|NC_020559.1_1631156_1632422_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|329aa|down_2|NC_020559.1_1632449_1633436_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|288aa|down_3|NC_020559.1_1633455_1634319_+	NA	NA|311aa|down_4|NC_020559.1_1634268_1635201_-	COG1612, CtaA, Uncharacterized protein required for cytochrome oxidase assembly [Posttranslational modification, protein turnover, chaperones]	NA|262aa|down_5|NC_020559.1_1635312_1636098_-	TIGR00025, Mtu_efflux, ABC transporter efflux protein, DrrB family	NA|314aa|down_6|NC_020559.1_1636094_1637036_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|592aa|down_7|NC_020559.1_1637191_1638967_-	TIGR03459, crt_membr, carotene biosynthesis associated membrane protein	NA|269aa|down_8|NC_020559.1_1639014_1639821_+	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|847aa|down_9|NC_020559.1_1639817_1642358_+	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	9	1626133-1626474	5	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCGCCCTTACCGCCGGCGCCGCCAGC	27	1	7	1626376-1626393|1626376-1626393|1626376-1626393|1626376-1626393|1626376-1626393|1626376-1626393|1626376-1626393	NC_020559.1_1217165-1217148|NC_020559.1_1611814-1611831|NC_020559.1_1629522-1629539|NC_020559.1_1649124-1649141|NC_020559.1_1993584-1993567|NC_020559.1_2052403-2052420|NC_020559.1_3784397-3784380	NA	5	5	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|38aa|up_6|NC_020559.1_1616134_1616248_-,NA|137aa|up_5|NC_020559.1_1616297_1616708_-,NA|288aa|down_3|NC_020559.1_1633455_1634319_+	NA|473aa|up_9|NC_020559.1_1611276_1612695_-	pfam00934, PE, PE family	NA|767aa|up_8|NC_020559.1_1612801_1615102_+	TIGR00509, Trimethylamine-N-oxide_reductase_1, molybdopterin guanine dinucleotide-containing S/N-oxide reductases	NA|162aa|up_7|NC_020559.1_1615217_1615703_-	cd07820, SRPBCC_3, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|38aa|up_6|NC_020559.1_1616134_1616248_-	NA	NA|137aa|up_5|NC_020559.1_1616297_1616708_-	NA	NA|248aa|up_4|NC_020559.1_1616724_1617468_-	pfam01182, Glucosamine_iso, Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase	NA|304aa|up_3|NC_020559.1_1617464_1618376_-	TIGR00534, Putative_OxPP_cycle_protein_OpcA, glucose-6-phosphate dehydrogenase assembly protein OpcA	NA|515aa|up_2|NC_020559.1_1618428_1619973_-	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|374aa|up_1|NC_020559.1_1619969_1621091_-	PRK03343, PRK03343, transaldolase; Validated	NA|701aa|up_0|NC_020559.1_1621107_1623210_-	COG0021, TktA, Transketolase [Carbohydrate transport and metabolism]	NA|309aa|down_0|NC_020559.1_1627817_1628744_+	PRK04375, PRK04375, protoheme IX farnesyltransferase; Provisional	NA|422aa|down_1|NC_020559.1_1631156_1632422_+	COG2508, COG2508, Regulator of polyketide synthase expression [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism]	NA|329aa|down_2|NC_020559.1_1632449_1633436_-	cd05286, QOR2, Quinone oxidoreductase (QOR)	NA|288aa|down_3|NC_020559.1_1633455_1634319_+	NA	NA|311aa|down_4|NC_020559.1_1634268_1635201_-	COG1612, CtaA, Uncharacterized protein required for cytochrome oxidase assembly [Posttranslational modification, protein turnover, chaperones]	NA|262aa|down_5|NC_020559.1_1635312_1636098_-	TIGR00025, Mtu_efflux, ABC transporter efflux protein, DrrB family	NA|314aa|down_6|NC_020559.1_1636094_1637036_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|592aa|down_7|NC_020559.1_1637191_1638967_-	TIGR03459, crt_membr, carotene biosynthesis associated membrane protein	NA|269aa|down_8|NC_020559.1_1639014_1639821_+	COG2345, COG2345, Predicted transcriptional regulator [Transcription]	NA|847aa|down_9|NC_020559.1_1639817_1642358_+	TIGR01980, UPF0051_protein_slr0074, FeS assembly protein SufB
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	10	2052494-2053060	6	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGGCGCCGCCGGCCCCGCCGTTGCCGA	28	3	6	2052678-2052697|2052726-2052745|2052917-2052936|2052917-2052936|2052917-2052936|2052917-2052936	NC_020559.1_924871-924852|NC_020559.1_2450441-2450422|NC_020559.1_1489370-1489389|NC_020559.1_1847703-1847722|NC_020559.1_2079785-2079804|NC_020559.1_2932221-2932240	NA	10	10	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|222aa|up_2|NC_020559.1_2048730_2049396_+,NA	NA|404aa|up_9|NC_020559.1_2039589_2040801_+	pfam00823, PPE, PPE family	NA|410aa|up_8|NC_020559.1_2041124_2042354_+	pfam00823, PPE, PPE family	NA|469aa|up_7|NC_020559.1_2042485_2043892_+	pfam00823, PPE, PPE family	NA|119aa|up_6|NC_020559.1_2044136_2044493_+	pfam05305, DUF732, Protein of unknown function (DUF732)	NA|235aa|up_5|NC_020559.1_2044646_2045351_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]	NA|144aa|up_4|NC_020559.1_2046883_2047315_-	pfam13827, DUF4189, Domain of unknown function (DUF4189)	NA|301aa|up_3|NC_020559.1_2047723_2048626_+	COG3000, ERG3, Sterol desaturase [Lipid metabolism]	NA|222aa|up_2|NC_020559.1_2048730_2049396_+	NA	NA|235aa|up_1|NC_020559.1_2049458_2050163_+	pfam13305, WHG, WHG domain	NA|488aa|up_0|NC_020559.1_2050720_2052184_+	PRK07121, PRK07121, FAD-binding protein	NA|640aa|down_0|NC_020559.1_2053627_2055547_-	COG4178, COG4178, ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only]	NA|548aa|down_1|NC_020559.1_2055617_2057261_+	PRK05858, PRK05858, acetolactate synthase	NA|809aa|down_2|NC_020559.1_2057275_2059702_+	PRK12326, PRK12326, preprotein translocase subunit SecA; Reviewed	NA|210aa|down_3|NC_020559.1_2059898_2060528_+	COG0558, PgsA, Phosphatidylglycerophosphate synthase [Lipid metabolism]	NA|308aa|down_4|NC_020559.1_2060520_2061444_+	COG3879, COG3879, Uncharacterized protein conserved in bacteria [Function unknown]	NA|122aa|down_5|NC_020559.1_2061472_2061838_+	COG3856, Sbp, Uncharacterized conserved protein (small basic protein) [Function unknown]	NA|293aa|down_6|NC_020559.1_2061854_2062733_+	COG3879, COG3879, Uncharacterized protein conserved in bacteria [Function unknown]	NA|135aa|down_7|NC_020559.1_2062770_2063175_+	PRK01202, PRK01202, glycine cleavage system protein GcvH	NA|163aa|down_8|NC_020559.1_2063414_2063903_+	COG1716, COG1716, FOG: FHA domain [Signal transduction mechanisms]	NA|248aa|down_9|NC_020559.1_2063899_2064643_+	cd00592, HTH_MerR-like, Helix-Turn-Helix DNA binding domain of MerR-like transcription regulators
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	11	2079108-2079362	1	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCNCCGTCGCCGCCNNTGCC	21	2	3	2079189-2079206|2079189-2079206|2079279-2079296	NC_020559.1_401448-401431|NC_020559.1_609032-609015|NC_020559.1_3436474-3436457	NA	5	5	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|88aa|up_0|NC_020559.1_2078467_2078731_-,NA	NA|165aa|up_9|NC_020559.1_2064761_2065256_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|226aa|up_8|NC_020559.1_2065659_2066337_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|942aa|up_7|NC_020559.1_2066695_2069521_+	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|287aa|up_6|NC_020559.1_2069747_2070608_-	PRK03204, PRK03204, haloalkane dehalogenase; Provisional	NA|289aa|up_5|NC_020559.1_2070648_2071515_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|629aa|up_4|NC_020559.1_2071519_2073406_-	TIGR00976, Hypothetical_protein_Rv1835c/MT1883/Mb1866c	NA|678aa|up_3|NC_020559.1_2073421_2075455_-	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|742aa|up_2|NC_020559.1_2075574_2077800_-	PRK02999, PRK02999, malate synthase G; Provisional	NA|132aa|up_1|NC_020559.1_2078075_2078471_-	COG1848, COG1848, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|88aa|up_0|NC_020559.1_2078467_2078731_-	NA	NA|346aa|down_0|NC_020559.1_2080499_2081537_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|456aa|down_1|NC_020559.1_2081536_2082904_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|480aa|down_2|NC_020559.1_2083077_2084517_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|484aa|down_3|NC_020559.1_2084549_2086001_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|317aa|down_4|NC_020559.1_2086030_2086981_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|139aa|down_5|NC_020559.1_2086995_2087412_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|141aa|down_6|NC_020559.1_2087689_2088112_+	COG2050, PaaI, HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis,    transport, and catabolism]	NA|101aa|down_7|NC_020559.1_2088160_2088463_+	pfam00547, Urease_gamma, Urease, gamma subunit	NA|105aa|down_8|NC_020559.1_2088459_2088774_+	PRK13202, ureB, urease subunit beta; Reviewed	NA|578aa|down_9|NC_020559.1_2088773_2090507_+	PRK13206, ureC, urease subunit alpha; Reviewed
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	12	2159909-2160128	7	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GTGCCAGCCGGAATCGTGATCGGCGGAACCGTCACCGACGGAATACTCA	49	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|157aa|up_9|NC_020559.1_2143304_2143775_-,NA|136aa|up_1|NC_020559.1_2150800_2151208_-,NA|127aa|down_5|NC_020559.1_2167644_2168025_-	NA|157aa|up_9|NC_020559.1_2143304_2143775_-	NA	NA|216aa|up_8|NC_020559.1_2144114_2144762_-	pfam14081, DUF4262, Domain of unknown function (DUF4262)	NA|741aa|up_7|NC_020559.1_2144768_2146991_-	PRK15061, PRK15061, catalase/peroxidase	NA|148aa|up_6|NC_020559.1_2147028_2147472_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|198aa|up_5|NC_020559.1_2147585_2148179_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|202aa|up_4|NC_020559.1_2148261_2148867_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|335aa|up_3|NC_020559.1_2148966_2149971_-	cd08275, MDR3, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|251aa|up_2|NC_020559.1_2150070_2150823_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|136aa|up_1|NC_020559.1_2150800_2151208_-	NA	NA|767aa|up_0|NC_020559.1_2151342_2153643_+	PLN02892, PLN02892, isocitrate lyase	NA|155aa|down_0|NC_020559.1_2162155_2162620_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|down_1|NC_020559.1_2162717_2163581_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|424aa|down_2|NC_020559.1_2163618_2164890_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|372aa|down_3|NC_020559.1_2165161_2166277_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|447aa|down_4|NC_020559.1_2166267_2167608_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|127aa|down_5|NC_020559.1_2167644_2168025_-	NA	NA|621aa|down_6|NC_020559.1_2168181_2170044_+	PRK12476, PRK12476, putative fatty-acid--CoA ligase; Provisional	NA|160aa|down_7|NC_020559.1_2170051_2170531_-	pfam09167, DUF1942, Domain of unknown function (DUF1942)	NA|258aa|down_8|NC_020559.1_2170767_2171541_+	COG3361, COG3361, Uncharacterized conserved protein [Function unknown]	NA|215aa|down_9|NC_020559.1_2172334_2172979_-	TIGR03085, TIGR03085, TIGR03085 family protein
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	13	3105878-3106794	5,8,2	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-B,Type III-D,Type III-C,Type III-A	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	11,12,12	12	TypeIII-B,TypeIII-D,TypeIII-C,TypeIII-A	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_8|NC_020559.1_3099157_3099412_-,NA|135aa|up_7|NC_020559.1_3099559_3099964_+,NA|64aa|up_6|NC_020559.1_3099960_3100152_+,NA|86aa|up_4|NC_020559.1_3101738_3101996_+,NA|104aa|up_3|NC_020559.1_3102100_3102412_+,NA|203aa|up_2|NC_020559.1_3102831_3103440_+,NA	NA|92aa|up_9|NC_020559.1_3098706_3098982_+	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|85aa|up_8|NC_020559.1_3099157_3099412_-	NA	NA|135aa|up_7|NC_020559.1_3099559_3099964_+	NA	NA|64aa|up_6|NC_020559.1_3099960_3100152_+	NA	NA|385aa|up_5|NC_020559.1_3100350_3101505_+	pfam00665, rve, Integrase core domain	NA|86aa|up_4|NC_020559.1_3101738_3101996_+	NA	NA|104aa|up_3|NC_020559.1_3102100_3102412_+	NA	NA|203aa|up_2|NC_020559.1_3102831_3103440_+	NA	NA|470aa|up_1|NC_020559.1_3103510_3104920_+	pfam00665, rve, Integrase core domain	NA|271aa|up_0|NC_020559.1_3104916_3105729_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|down_0|NC_020559.1_3106833_3108095_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_1|NC_020559.1_3110046_3110388_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_2|NC_020559.1_3110388_3111405_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	NA|421aa|down_3|NC_020559.1_3111661_3112923_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|421aa|down_4|NC_020559.1_3113615_3114877_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	csm4gr5|303aa|down_5|NC_020559.1_3115763_3116672_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_6|NC_020559.1_3116652_3117363_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_7|NC_020559.1_3117372_3117747_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|813aa|down_8|NC_020559.1_3117743_3120182_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_9|NC_020559.1_3120178_3120901_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	14	3108130-3109998	9,3,6	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-B,Type III-D,Type III-C,Type III-A	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	25,25,24	25	TypeIII-B,TypeIII-D,TypeIII-C,TypeIII-A	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_9|NC_020559.1_3099157_3099412_-,NA|135aa|up_8|NC_020559.1_3099559_3099964_+,NA|64aa|up_7|NC_020559.1_3099960_3100152_+,NA|86aa|up_5|NC_020559.1_3101738_3101996_+,NA|104aa|up_4|NC_020559.1_3102100_3102412_+,NA|203aa|up_3|NC_020559.1_3102831_3103440_+,NA	NA|85aa|up_9|NC_020559.1_3099157_3099412_-	NA	NA|135aa|up_8|NC_020559.1_3099559_3099964_+	NA	NA|64aa|up_7|NC_020559.1_3099960_3100152_+	NA	NA|385aa|up_6|NC_020559.1_3100350_3101505_+	pfam00665, rve, Integrase core domain	NA|86aa|up_5|NC_020559.1_3101738_3101996_+	NA	NA|104aa|up_4|NC_020559.1_3102100_3102412_+	NA	NA|203aa|up_3|NC_020559.1_3102831_3103440_+	NA	NA|470aa|up_2|NC_020559.1_3103510_3104920_+	pfam00665, rve, Integrase core domain	NA|271aa|up_1|NC_020559.1_3104916_3105729_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|up_0|NC_020559.1_3106833_3108095_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_0|NC_020559.1_3110046_3110388_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_1|NC_020559.1_3110388_3111405_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	NA|421aa|down_2|NC_020559.1_3111661_3112923_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|421aa|down_3|NC_020559.1_3113615_3114877_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	csm4gr5|303aa|down_4|NC_020559.1_3115763_3116672_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_5|NC_020559.1_3116652_3117363_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_6|NC_020559.1_3117372_3117747_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|813aa|down_7|NC_020559.1_3117743_3120182_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_8|NC_020559.1_3120178_3120901_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_9|NC_020559.1_3121300_3121846_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	15	3723832-3724373	4	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGTTNCCNCCGTTGCCGCC	23	0	0	NA	NA	NA	7	7	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|94aa|up_0|NC_020559.1_3721135_3721417_+,NA|96aa|down_4|NC_020559.1_3738500_3738788_-,NA|87aa|down_8|NC_020559.1_3751896_3752157_-	NA|290aa|up_9|NC_020559.1_3706635_3707505_-	TIGR00766, Uncharacterized_protein_Dda3937_02003, inner membrane protein YhjD	NA|337aa|up_8|NC_020559.1_3707525_3708536_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|215aa|up_7|NC_020559.1_3708808_3709453_+	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|410aa|up_6|NC_020559.1_3709519_3710749_-	PRK08299, PRK08299, NADP-dependent isocitrate dehydrogenase	NA|450aa|up_5|NC_020559.1_3711031_3712381_+	PRK07812, PRK07812, O-acetylhomoserine aminocarboxypropyltransferase; Validated	NA|380aa|up_4|NC_020559.1_3712392_3713532_+	PRK00175, metX, homoserine O-acetyltransferase; Provisional	NA|244aa|up_3|NC_020559.1_3713528_3714260_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|1948aa|up_2|NC_020559.1_3714268_3720112_-	pfam00823, PPE, PPE family	NA|339aa|up_1|NC_020559.1_3720160_3721177_-	pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex non-fungal	NA|94aa|up_0|NC_020559.1_3721135_3721417_+	NA	NA|86aa|down_0|NC_020559.1_3726370_3726628_-	pfam11222, DUF3017, Protein of unknown function (DUF3017)	NA|3158aa|down_1|NC_020559.1_3726883_3736357_-	pfam00823, PPE, PPE family	NA|149aa|down_2|NC_020559.1_3736982_3737429_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|247aa|down_3|NC_020559.1_3737465_3738206_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|96aa|down_4|NC_020559.1_3738500_3738788_-	NA	NA|265aa|down_5|NC_020559.1_3750506_3751301_-	pfam08031, BBE, Berberine and berberine like	NA|124aa|down_6|NC_020559.1_3751382_3751754_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|73aa|down_7|NC_020559.1_3751651_3751870_-	pfam01565, FAD_binding_4, FAD binding domain	NA|87aa|down_8|NC_020559.1_3751896_3752157_-	NA	NA|130aa|down_9|NC_020559.1_3752271_3752661_+	pfam05305, DUF732, Protein of unknown function (DUF732)
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	16	3829797-3829886	10	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCAGGCGTTGGGCTGGCTGCCGAT	24	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|77aa|up_7|NC_020559.1_3823191_3823422_+,NA|121aa|up_6|NC_020559.1_3823525_3823888_-,NA|73aa|up_2|NC_020559.1_3827862_3828081_-,NA|121aa|down_0|NC_020559.1_3830871_3831234_-,NA|52aa|down_2|NC_020559.1_3833075_3833231_-,NA|52aa|down_3|NC_020559.1_3833255_3833411_-,NA|238aa|down_6|NC_020559.1_3837446_3838160_-	NA|169aa|up_9|NC_020559.1_3821502_3822009_-	COG0802, COG0802, Predicted ATPase or kinase [General function prediction only]	NA|409aa|up_8|NC_020559.1_3822005_3823232_-	PRK00053, alr, alanine racemase; Reviewed	NA|77aa|up_7|NC_020559.1_3823191_3823422_+	NA	NA|121aa|up_6|NC_020559.1_3823525_3823888_-	NA	NA|177aa|up_5|NC_020559.1_3824050_3824581_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|177aa|up_4|NC_020559.1_3824847_3825378_+	pfam00823, PPE, PPE family	NA|725aa|up_3|NC_020559.1_3825695_3827870_-	COG1484, DnaC, DNA replication protein [DNA replication, recombination, and repair]	NA|73aa|up_2|NC_020559.1_3827862_3828081_-	NA	NA|100aa|up_1|NC_020559.1_3828575_3828875_+	pfam00934, PE, PE family	NA|181aa|up_0|NC_020559.1_3828960_3829503_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|121aa|down_0|NC_020559.1_3830871_3831234_-	NA	NA|179aa|down_1|NC_020559.1_3831396_3831933_+	pfam00823, PPE, PPE family	NA|52aa|down_2|NC_020559.1_3833075_3833231_-	NA	NA|52aa|down_3|NC_020559.1_3833255_3833411_-	NA	NA|461aa|down_4|NC_020559.1_3834603_3835986_-	TIGR01788, Glutamate_decarboxylase_alpha_GAD-alpha	NA|474aa|down_5|NC_020559.1_3836023_3837445_-	pfam01256, Carb_kinase, Carbohydrate kinase	NA|238aa|down_6|NC_020559.1_3837446_3838160_-	NA	NA|285aa|down_7|NC_020559.1_3838170_3839025_-	pfam14494, DUF4436, Domain of unknown function (DUF4436)	NA|625aa|down_8|NC_020559.1_3839246_3841121_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|159aa|down_9|NC_020559.1_3841142_3841619_+	pfam10708, DUF2510, Protein of unknown function (DUF2510)
GCF_000350205.1_ASM35020v1	NC_020559	Mycobacterium tuberculosis str. Erdman = ATCC 35801 chromosome 1, complete sequence	17	4091415-4091503	11	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCTCGGCGACGATGCGGGCCGGATGACGGCC	31	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|257aa|up_6|NC_020559.1_4082001_4082772_-,NA|233aa|up_0|NC_020559.1_4090519_4091218_-,NA|126aa|down_6|NC_020559.1_4096738_4097116_+	NA|388aa|up_9|NC_020559.1_4077672_4078836_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|up_8|NC_020559.1_4078832_4079885_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|up_7|NC_020559.1_4080383_4081247_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|up_6|NC_020559.1_4082001_4082772_-	NA	NA|549aa|up_5|NC_020559.1_4082768_4084415_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|288aa|up_4|NC_020559.1_4084411_4085275_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|up_3|NC_020559.1_4085267_4086194_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|542aa|up_2|NC_020559.1_4086195_4087821_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|652aa|up_1|NC_020559.1_4088528_4090484_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|233aa|up_0|NC_020559.1_4090519_4091218_-	NA	NA|173aa|down_0|NC_020559.1_4091563_4092082_+	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|328aa|down_1|NC_020559.1_4092082_4093066_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|398aa|down_2|NC_020559.1_4093058_4094252_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|274aa|down_3|NC_020559.1_4094257_4095079_-	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|228aa|down_4|NC_020559.1_4095210_4095894_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|246aa|down_5|NC_020559.1_4095893_4096631_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|126aa|down_6|NC_020559.1_4096738_4097116_+	NA	NA|225aa|down_7|NC_020559.1_4097214_4097889_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|265aa|down_8|NC_020559.1_4097992_4098787_-	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|152aa|down_9|NC_020559.1_4098793_4099249_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
