assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	1	330767-330877	1	PILER-CR	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGTTGCCGCCGTTGCCGATC	24	0	0	NA	NA	NA	2	2	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|61aa|up_0|NZ_AP018034.1_329099_329282_-,NA	NA|398aa|up_9|NZ_AP018034.1_319277_320471_-	COG3285, COG3285, Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]	NA|561aa|up_8|NZ_AP018034.1_320506_322189_+	PRK07788, PRK07788, acyl-CoA synthetase; Validated	NA|732aa|up_7|NZ_AP018034.1_322205_324401_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|378aa|up_6|NZ_AP018034.1_324514_325648_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|207aa|up_5|NZ_AP018034.1_325644_326265_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|194aa|up_4|NZ_AP018034.1_326361_326943_+	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|242aa|up_3|NZ_AP018034.1_326872_327598_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|307aa|up_2|NZ_AP018034.1_327687_328608_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|143aa|up_1|NZ_AP018034.1_328647_329076_-	cd18678, PIN_MtVapC25_VapC33-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC25, VapC33, and related proteins	NA|61aa|up_0|NZ_AP018034.1_329099_329282_-	NA	NA|903aa|down_0|NZ_AP018034.1_332447_335156_-	pfam00934, PE, PE family	NA|537aa|down_1|NZ_AP018034.1_335446_337057_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|303aa|down_2|NZ_AP018034.1_337080_337989_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|632aa|down_3|NZ_AP018034.1_338212_340108_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|539aa|down_4|NZ_AP018034.1_340104_341721_+	pfam05108, T7SS_ESX1_EccB, Type VII secretion system ESX-1, transport TM domain B	NA|1331aa|down_5|NZ_AP018034.1_341717_345710_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|103aa|down_6|NZ_AP018034.1_345706_346015_+	pfam00934, PE, PE family	NA|514aa|down_7|NZ_AP018034.1_346017_347559_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|98aa|down_8|NZ_AP018034.1_347607_347901_+	TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target	NA|97aa|down_9|NZ_AP018034.1_347930_348221_+	COG4842, COG4842, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	2	362546-363244	1	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	TTCGCGAAGCCGATGTTGTAGCTGCCGGTGTTG	33	2	2	362834-362875|362909-362941	NZ_AP018034.1_370850-370891|NZ_AP018034.1_369665-369697	NA	10	10	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|101aa|up_4|NZ_AP018034.1_359558_359861_+,NA|161aa|down_1|NZ_AP018034.1_372639_373122_-,NA|410aa|down_5|NZ_AP018034.1_375238_376468_+	NA|262aa|up_9|NZ_AP018034.1_354253_355039_+	PRK14103, PRK14103, trans-aconitate 2-methyltransferase; Provisional	NA|268aa|up_8|NZ_AP018034.1_355027_355831_-	COG4424, COG4424, Uncharacterized protein conserved in bacteria [Function unknown]	NA|466aa|up_7|NZ_AP018034.1_355840_357238_-	cd16027, SGSH, N-sulfoglucosamine sulfohydrolase (SGSH; sulfamidase)	NA|592aa|up_6|NZ_AP018034.1_357416_359192_+	pfam00934, PE, PE family	NA|76aa|up_5|NZ_AP018034.1_359334_359562_+	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|101aa|up_4|NZ_AP018034.1_359558_359861_+	NA	NA|74aa|up_3|NZ_AP018034.1_359908_360130_+	PHA01748, PHA01748, hypothetical protein	NA|142aa|up_2|NZ_AP018034.1_360126_360552_+	cd18755, PIN_MtVapC3_VapC21-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC3, VapC21 and related proteins	NA|211aa|up_1|NZ_AP018034.1_360687_361320_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|303aa|up_0|NZ_AP018034.1_361316_362225_+	cd09810, LPOR_like_SDR_c_like, light-dependent protochlorophyllide reductase (LPOR)-like, classical (c)-like SDRs	NA|224aa|down_0|NZ_AP018034.1_371980_372652_+	TIGR02476, BluB, 5,6-dimethylbenzimidazole synthase	NA|161aa|down_1|NZ_AP018034.1_372639_373122_-	NA	NA|239aa|down_2|NZ_AP018034.1_373179_373896_+	cd03392, PAP2_like_2, PAP2_like_2 proteins	NA|219aa|down_3|NZ_AP018034.1_373997_374654_+	COG3786, COG3786, Uncharacterized protein conserved in bacteria [Function unknown]	NA|164aa|down_4|NZ_AP018034.1_374723_375215_-	pfam13577, SnoaL_4, SnoaL-like domain	NA|410aa|down_5|NZ_AP018034.1_375238_376468_+	NA	NA|621aa|down_6|NZ_AP018034.1_376622_378485_+	COG0443, DnaK, Molecular chaperone [Posttranslational modification, protein turnover, chaperones]	NA|129aa|down_7|NZ_AP018034.1_378556_378943_+	COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational modification, protein turnover, chaperones]	NA|212aa|down_8|NZ_AP018034.1_378945_379581_-	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|295aa|down_9|NZ_AP018034.1_379668_380553_+	COG2273, SKN1, Beta-glucanase/Beta-glucan synthetase [Carbohydrate transport and metabolism]
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	3	688079-688155	2	CRISPRCasFinder	no	c2c9_V-U4	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Type V-U4	TGAGGTGCGGCGTGAGCGCGGGT	23	0	0	NA	NA	NA	1	1	TypeV-U4	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|136aa|up_9|NZ_AP018034.1_673974_674382_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|229aa|up_8|NZ_AP018034.1_674441_675128_-	pfam10738, Lpp-LpqN, Probable lipoprotein LpqN	NA|878aa|up_7|NZ_AP018034.1_675281_677915_+	COG3537, COG3537, Putative alpha-1,2-mannosidase [Carbohydrate transport and metabolism]	NA|796aa|up_6|NZ_AP018034.1_677937_680325_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|241aa|up_5|NZ_AP018034.1_680462_681185_+	COG2186, FadR, Transcriptional regulators [Transcription]	NA|266aa|up_4|NZ_AP018034.1_681181_681979_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|296aa|up_3|NZ_AP018034.1_681980_682868_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|405aa|up_2|NZ_AP018034.1_682873_684088_+	pfam11887, Mce4_CUP1, Cholesterol uptake porter CUP1 of Mce4, putative	NA|344aa|up_1|NZ_AP018034.1_684084_685116_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|482aa|up_0|NZ_AP018034.1_685112_686558_+	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|517aa|down_0|NZ_AP018034.1_689290_690841_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|131aa|down_1|NZ_AP018034.1_690892_691285_-	cd18768, PIN_MtVapC4-C5-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4, VapC5, and related proteins	NA|86aa|down_2|NZ_AP018034.1_691281_691539_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|412aa|down_3|NZ_AP018034.1_691721_692957_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|138aa|down_4|NZ_AP018034.1_693207_693621_-	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|79aa|down_5|NZ_AP018034.1_693617_693854_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|169aa|down_6|NZ_AP018034.1_693957_694464_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|157aa|down_7|NZ_AP018034.1_694577_695048_-	PRK10755, PRK10755, two-component system sensor histidine kinase PmrB	NA|254aa|down_8|NZ_AP018034.1_695091_695853_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|104aa|down_9|NZ_AP018034.1_695909_696221_+	pfam03413, PepSY, Peptidase propeptide and YPEB domain
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	4	958452-958703	2	PILER-CR	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CGGGTGGTGCCAGGTCGGCGGGCGCGGGTGGCGCCAGTTCGGCGGGCGCGGGTGGCGCCAGGT	63	1	1	958625-958680	NZ_AP018034.1_958753-958808	NA	2	2	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|67aa|up_3|NZ_AP018034.1_956550_956751_+,NA	NA|561aa|up_9|NZ_AP018034.1_945707_947390_-	COG3961, COG3961, Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes [Carbohydrate transport and metabolism / Coenzyme metabolism / General function prediction only]	NA|148aa|up_8|NZ_AP018034.1_947454_947898_+	cd07819, SRPBCC_2, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|404aa|up_7|NZ_AP018034.1_948947_950159_+	PRK08242, PRK08242, acetyl-CoA C-acetyltransferase	NA|721aa|up_6|NZ_AP018034.1_950163_952326_+	PRK11730, fadB, fatty acid oxidation complex subunit alpha FadB	NA|543aa|up_5|NZ_AP018034.1_952393_954022_-	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|757aa|up_4|NZ_AP018034.1_954212_956483_-	pfam13625, Helicase_C_3, Helicase conserved C-terminal domain	NA|67aa|up_3|NZ_AP018034.1_956550_956751_+	NA	NA|168aa|up_2|NZ_AP018034.1_956760_957264_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|161aa|up_1|NZ_AP018034.1_957260_957743_+	cd00886, MogA_MoaB, MogA_MoaB family	NA|142aa|up_0|NZ_AP018034.1_957739_958165_+	COG0314, MoaE, Molybdopterin converting factor, large subunit [Coenzyme metabolism]	NA|93aa|down_0|NZ_AP018034.1_959853_960132_-	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|361aa|down_1|NZ_AP018034.1_960135_961218_-	PRK00164, moaA, GTP 3',8-cyclase MoaA	NA|130aa|down_2|NZ_AP018034.1_961214_961604_-	PRK11770, PRK11770, YccF domain-containing protein	NA|136aa|down_3|NZ_AP018034.1_961768_962176_+	COG1278, CspC, Cold shock proteins [Transcription]	NA|610aa|down_4|NZ_AP018034.1_962294_964124_-	pfam00934, PE, PE family	NA|651aa|down_5|NZ_AP018034.1_964384_966337_+	cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase	NA|387aa|down_6|NZ_AP018034.1_966425_967586_-	COG4398, COG4398, Uncharacterized protein conserved in bacteria [Function unknown]	NA|163aa|down_7|NZ_AP018034.1_967685_968174_-	pfam10969, DUF2771, Protein of unknown function (DUF2771)	NA|549aa|down_8|NZ_AP018034.1_968170_969817_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|263aa|down_9|NZ_AP018034.1_969954_970743_+	pfam11228, DUF3027, Protein of unknown function (DUF3027)
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	5	1568948-1570020	3	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGTTGCCGCCGGCACCGCCGTCGCCGAT	32	1	1	1569148-1569169	NZ_AP018034.1_1206120-1206099	NA	15	15	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|191aa|up_7|NZ_AP018034.1_1560456_1561029_+,NA	NA|103aa|up_9|NZ_AP018034.1_1558226_1558535_+	pfam00934, PE, PE family	NA|540aa|up_8|NZ_AP018034.1_1558531_1560151_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|191aa|up_7|NZ_AP018034.1_1560456_1561029_+	NA	NA|209aa|up_6|NZ_AP018034.1_1561163_1561790_+	PRK00300, gmk, guanylate kinase; Provisional	NA|111aa|up_5|NZ_AP018034.1_1561855_1562188_+	TIGR00690, DNA-directed_RNA_polymerase_subunit_omega, DNA-directed RNA polymerase, omega subunit	NA|419aa|up_4|NZ_AP018034.1_1562203_1563460_+	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated	NA|404aa|up_3|NZ_AP018034.1_1563587_1564799_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|493aa|up_2|NZ_AP018034.1_1564871_1566350_-	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|462aa|up_1|NZ_AP018034.1_1566346_1567732_-	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|345aa|up_0|NZ_AP018034.1_1567809_1568844_+	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein	NA|134aa|down_0|NZ_AP018034.1_1570874_1571276_-	cd18741, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|86aa|down_1|NZ_AP018034.1_1571272_1571530_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|320aa|down_2|NZ_AP018034.1_1571612_1572572_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|321aa|down_3|NZ_AP018034.1_1572596_1573559_-	pfam07859, Abhydrolase_3, alpha/beta hydrolase fold	NA|201aa|down_4|NZ_AP018034.1_1573692_1574295_+	COG3714, COG3714, Predicted membrane protein [Function unknown]	NA|656aa|down_5|NZ_AP018034.1_1574375_1576343_+	PRK14873, PRK14873, primosomal protein N'	NA|275aa|down_6|NZ_AP018034.1_1576360_1577185_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|161aa|down_7|NZ_AP018034.1_1577353_1577836_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|275aa|down_8|NZ_AP018034.1_1577907_1578732_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|313aa|down_9|NZ_AP018034.1_1578928_1579867_+	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	6	2067558-2067812	1	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCNCCGTCGCCGCCNNTGCC	21	2	3	2067639-2067656|2067639-2067656|2067729-2067746	NZ_AP018034.1_397146-397129|NZ_AP018034.1_603413-603396|NZ_AP018034.1_3433019-3433002	NA	5	5	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|88aa|up_0|NZ_AP018034.1_2066917_2067181_-,NA	NA|165aa|up_9|NZ_AP018034.1_2053267_2053762_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|226aa|up_8|NZ_AP018034.1_2054109_2054787_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|942aa|up_7|NZ_AP018034.1_2055145_2057971_+	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|287aa|up_6|NZ_AP018034.1_2058197_2059058_-	PRK03204, PRK03204, haloalkane dehalogenase; Provisional	NA|289aa|up_5|NZ_AP018034.1_2059098_2059965_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|629aa|up_4|NZ_AP018034.1_2059969_2061856_-	TIGR00976, Hypothetical_protein_Rv1835c/MT1883/Mb1866c	NA|678aa|up_3|NZ_AP018034.1_2061871_2063905_-	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|742aa|up_2|NZ_AP018034.1_2064024_2066250_-	PRK02999, PRK02999, malate synthase G; Provisional	NA|132aa|up_1|NZ_AP018034.1_2066525_2066921_-	COG1848, COG1848, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|88aa|up_0|NZ_AP018034.1_2066917_2067181_-	NA	NA|346aa|down_0|NZ_AP018034.1_2068949_2069987_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|456aa|down_1|NZ_AP018034.1_2069986_2071354_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|480aa|down_2|NZ_AP018034.1_2071527_2072967_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|484aa|down_3|NZ_AP018034.1_2072999_2074451_-	PRK09287, PRK09287, NADP-dependent phosphogluconate dehydrogenase	NA|317aa|down_4|NZ_AP018034.1_2074480_2075431_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|139aa|down_5|NZ_AP018034.1_2075445_2075862_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|141aa|down_6|NZ_AP018034.1_2076139_2076562_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|101aa|down_7|NZ_AP018034.1_2076610_2076913_+	pfam00547, Urease_gamma, Urease, gamma subunit	NA|105aa|down_8|NZ_AP018034.1_2076909_2077224_+	PRK13202, ureB, urease subunit beta; Reviewed	NA|578aa|down_9|NZ_AP018034.1_2077223_2078957_+	PRK13206, ureC, urease subunit alpha; Reviewed
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	7	2150433-2150652	4	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GTGCCAGCCGGAATCGTGATCGGCGGAACCGTCACCGACGGAATACTCA	49	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|136aa|up_2|NZ_AP018034.1_2139240_2139648_-,NA|127aa|down_5|NZ_AP018034.1_2158168_2158549_-	NA|155aa|up_9|NZ_AP018034.1_2132568_2133033_-	pfam14081, DUF4262, Domain of unknown function (DUF4262)	NA|741aa|up_8|NZ_AP018034.1_2133208_2135431_-	PRK15061, PRK15061, catalase/peroxidase	NA|148aa|up_7|NZ_AP018034.1_2135468_2135912_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|198aa|up_6|NZ_AP018034.1_2136025_2136619_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|202aa|up_5|NZ_AP018034.1_2136701_2137307_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|335aa|up_4|NZ_AP018034.1_2137406_2138411_-	cd08275, MDR3, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|251aa|up_3|NZ_AP018034.1_2138510_2139263_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|136aa|up_2|NZ_AP018034.1_2139240_2139648_-	NA	NA|767aa|up_1|NZ_AP018034.1_2139782_2142083_+	PLN02892, PLN02892, isocitrate lyase	NA|421aa|up_0|NZ_AP018034.1_2146209_2147471_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|155aa|down_0|NZ_AP018034.1_2152679_2153144_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|down_1|NZ_AP018034.1_2153241_2154105_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|424aa|down_2|NZ_AP018034.1_2154142_2155414_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|372aa|down_3|NZ_AP018034.1_2155685_2156801_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|447aa|down_4|NZ_AP018034.1_2156791_2158132_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|127aa|down_5|NZ_AP018034.1_2158168_2158549_-	NA	NA|621aa|down_6|NZ_AP018034.1_2158705_2160568_+	PRK12476, PRK12476, putative fatty-acid--CoA ligase; Provisional	NA|160aa|down_7|NZ_AP018034.1_2160575_2161055_-	pfam09167, DUF1942, Domain of unknown function (DUF1942)	NA|258aa|down_8|NZ_AP018034.1_2161291_2162065_+	COG3361, COG3361, Uncharacterized conserved protein [Function unknown]	NA|256aa|down_9|NZ_AP018034.1_2162068_2162836_-	PRK05867, PRK05867, SDR family oxidoreductase
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	8	3106394-3107541	3,5,2	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4,csm3gr7,csm2gr11,cas10,cas6	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Type III-C,Type III-A,Type III-D,Type III-B	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	12,14,15	15	TypeIII-C,TypeIII-A,TypeIII-D,TypeIII-B	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_8|NZ_AP018034.1_3099673_3099928_-,NA|135aa|up_7|NZ_AP018034.1_3100075_3100480_+,NA|64aa|up_6|NZ_AP018034.1_3100476_3100668_+,NA|86aa|up_4|NZ_AP018034.1_3102254_3102512_+,NA|104aa|up_3|NZ_AP018034.1_3102616_3102928_+,NA|203aa|up_2|NZ_AP018034.1_3103347_3103956_+,NA	NA|92aa|up_9|NZ_AP018034.1_3099222_3099498_+	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|85aa|up_8|NZ_AP018034.1_3099673_3099928_-	NA	NA|135aa|up_7|NZ_AP018034.1_3100075_3100480_+	NA	NA|64aa|up_6|NZ_AP018034.1_3100476_3100668_+	NA	NA|385aa|up_5|NZ_AP018034.1_3100866_3102021_+	pfam00665, rve, Integrase core domain	NA|86aa|up_4|NZ_AP018034.1_3102254_3102512_+	NA	NA|104aa|up_3|NZ_AP018034.1_3102616_3102928_+	NA	NA|203aa|up_2|NZ_AP018034.1_3103347_3103956_+	NA	NA|470aa|up_1|NZ_AP018034.1_3104026_3105436_+	pfam00665, rve, Integrase core domain	NA|271aa|up_0|NZ_AP018034.1_3105432_3106245_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|down_0|NZ_AP018034.1_3107572_3108834_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	csm3gr7|237aa|down_1|NZ_AP018034.1_3109197_3109908_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_2|NZ_AP018034.1_3109917_3110292_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|813aa|down_3|NZ_AP018034.1_3110288_3112727_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_4|NZ_AP018034.1_3112723_3113446_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_5|NZ_AP018034.1_3113845_3114391_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|295aa|down_6|NZ_AP018034.1_3114662_3115547_-	COG2253, COG2253, Uncharacterized conserved protein [Function unknown]	NA|296aa|down_7|NZ_AP018034.1_3115549_3116437_-	pfam09407, AbiEi_1, AbiEi antitoxin C-terminal domain	NA|182aa|down_8|NZ_AP018034.1_3116741_3117287_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|90aa|down_9|NZ_AP018034.1_3117283_3117553_-	COG5552, COG5552, Uncharacterized conserved protein [Function unknown]
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	9	3650399-3650517	6	CRISPRCasFinder	no	cas3	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Unclear	GCCCCTGTGAGTCGAGTGAGCGGAACGAAC	30	1	1	3650429-3650487	NZ_AP018034.1_3650492-3650550	NA	1	1	Unclear	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|138aa|down_6|NZ_AP018034.1_3656150_3656564_-,NA|126aa|down_7|NZ_AP018034.1_3656598_3656976_-	NA|719aa|up_9|NZ_AP018034.1_3637225_3639382_+	COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and metabolism]	NA|223aa|up_8|NZ_AP018034.1_3639378_3640047_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|395aa|up_7|NZ_AP018034.1_3640147_3641332_+	pfam02515, CoA_transf_3, CoA-transferase family III	NA|765aa|up_6|NZ_AP018034.1_3641336_3643631_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|390aa|up_5|NZ_AP018034.1_3643619_3644789_-	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|175aa|up_4|NZ_AP018034.1_3644813_3645338_-	PLN02948, PLN02948, phosphoribosylaminoimidazole carboxylase	NA|430aa|up_3|NZ_AP018034.1_3645334_3646624_-	TIGR01161, N5-carboxyaminoimidazole_ribonucleotide_synthase, phosphoribosylaminoimidazole carboxylase, PurK protein	NA|273aa|up_2|NZ_AP018034.1_3646577_3647396_+	COG2246, COG2246, Predicted membrane protein [Function unknown]	NA|173aa|up_1|NZ_AP018034.1_3647350_3647869_-	pfam03703, bPH_2, Bacterial PH domain	NA|267aa|up_0|NZ_AP018034.1_3647911_3648712_-	COG0340, BirA, Biotin-(acetyl-CoA carboxylase) ligase [Coenzyme metabolism]	NA|223aa|down_0|NZ_AP018034.1_3650783_3651452_+	PRK00148, PRK00148, Maf-like protein; Reviewed	NA|298aa|down_1|NZ_AP018034.1_3651492_3652386_+	COG2897, SseA, Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism]	NA|144aa|down_2|NZ_AP018034.1_3652382_3652814_+	COG2166, sufE, Cysteine desulfurase SufE subunit [Posttranslational modification, protein turnover, chaperones]	NA|601aa|down_3|NZ_AP018034.1_3652921_3654724_+	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|262aa|down_4|NZ_AP018034.1_3654733_3655519_-	PRK07122, PRK07122, RNA polymerase sigma factor SigF; Reviewed	NA|146aa|down_5|NZ_AP018034.1_3655515_3655953_-	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|138aa|down_6|NZ_AP018034.1_3656150_3656564_-	NA	NA|126aa|down_7|NZ_AP018034.1_3656598_3656976_-	NA	NA|450aa|down_8|NZ_AP018034.1_3657009_3658359_-	PRK08297, PRK08297, L-lysine aminotransferase; Provisional	NA|151aa|down_9|NZ_AP018034.1_3658409_3658862_-	smart00344, HTH_ASNC, helix_turn_helix ASNC type
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	10	3735816-3736392	3	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCGCCGTTGCCNCCGNNGCCGCCG	25	0	0	NA	NA	NA	7	7	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA,NA|96aa|down_4|NZ_AP018034.1_3750624_3750912_-,NA|87aa|down_8|NZ_AP018034.1_3764031_3764292_-	NA|147aa|up_9|NZ_AP018034.1_3712900_3713341_+	cd04770, HTH_HMRTR, Helix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators	NA|290aa|up_8|NZ_AP018034.1_3713374_3714244_-	TIGR00766, Uncharacterized_protein_Dda3937_02003, inner membrane protein YhjD	NA|337aa|up_7|NZ_AP018034.1_3714264_3715275_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|215aa|up_6|NZ_AP018034.1_3715547_3716192_+	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|410aa|up_5|NZ_AP018034.1_3716258_3717488_-	PRK08299, PRK08299, NADP-dependent isocitrate dehydrogenase	NA|450aa|up_4|NZ_AP018034.1_3717770_3719120_+	PRK07812, PRK07812, O-acetylhomoserine aminocarboxypropyltransferase; Validated	NA|380aa|up_3|NZ_AP018034.1_3719131_3720271_+	PRK00175, metX, homoserine O-acetyltransferase; Provisional	NA|244aa|up_2|NZ_AP018034.1_3720267_3720999_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|3690aa|up_1|NZ_AP018034.1_3721007_3732077_-	pfam00823, PPE, PPE family	NA|327aa|up_0|NZ_AP018034.1_3732125_3733106_-	pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex non-fungal	NA|86aa|down_0|NZ_AP018034.1_3738494_3738752_-	pfam11222, DUF3017, Protein of unknown function (DUF3017)	NA|3158aa|down_1|NZ_AP018034.1_3739007_3748481_-	pfam00823, PPE, PPE family	NA|149aa|down_2|NZ_AP018034.1_3749106_3749553_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|247aa|down_3|NZ_AP018034.1_3749589_3750330_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|96aa|down_4|NZ_AP018034.1_3750624_3750912_-	NA	NA|265aa|down_5|NZ_AP018034.1_3762641_3763436_-	pfam08031, BBE, Berberine and berberine like	NA|124aa|down_6|NZ_AP018034.1_3763517_3763889_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|73aa|down_7|NZ_AP018034.1_3763786_3764005_-	pfam01565, FAD_binding_4, FAD binding domain	NA|87aa|down_8|NZ_AP018034.1_3764031_3764292_-	NA	NA|130aa|down_9|NZ_AP018034.1_3764406_3764796_+	pfam05305, DUF732, Protein of unknown function (DUF732)
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	11	3845264-3845353	7	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCAGGCGTTGGGCTGGCTGCCGAT	24	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|77aa|up_7|NZ_AP018034.1_3837300_3837531_+,NA|121aa|up_6|NZ_AP018034.1_3837634_3837997_-,NA|73aa|up_2|NZ_AP018034.1_3843329_3843548_-,NA|121aa|down_0|NZ_AP018034.1_3846338_3846701_-,NA|52aa|down_2|NZ_AP018034.1_3848542_3848698_-,NA|52aa|down_3|NZ_AP018034.1_3848722_3848878_-,NA|238aa|down_6|NZ_AP018034.1_3852913_3853627_-	NA|169aa|up_9|NZ_AP018034.1_3835611_3836118_-	COG0802, COG0802, Predicted ATPase or kinase [General function prediction only]	NA|409aa|up_8|NZ_AP018034.1_3836114_3837341_-	PRK00053, alr, alanine racemase; Reviewed	NA|77aa|up_7|NZ_AP018034.1_3837300_3837531_+	NA	NA|121aa|up_6|NZ_AP018034.1_3837634_3837997_-	NA	NA|177aa|up_5|NZ_AP018034.1_3838955_3839486_+	pfam00823, PPE, PPE family	NA|252aa|up_4|NZ_AP018034.1_3839803_3840559_-	COG1484, DnaC, DNA replication protein [DNA replication, recombination, and repair]	NA|421aa|up_3|NZ_AP018034.1_3840642_3841904_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|73aa|up_2|NZ_AP018034.1_3843329_3843548_-	NA	NA|100aa|up_1|NZ_AP018034.1_3844042_3844342_+	pfam00934, PE, PE family	NA|181aa|up_0|NZ_AP018034.1_3844427_3844970_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|121aa|down_0|NZ_AP018034.1_3846338_3846701_-	NA	NA|179aa|down_1|NZ_AP018034.1_3846863_3847400_+	pfam00823, PPE, PPE family	NA|52aa|down_2|NZ_AP018034.1_3848542_3848698_-	NA	NA|52aa|down_3|NZ_AP018034.1_3848722_3848878_-	NA	NA|461aa|down_4|NZ_AP018034.1_3850070_3851453_-	TIGR01788, Glutamate_decarboxylase_alpha_GAD-alpha	NA|474aa|down_5|NZ_AP018034.1_3851490_3852912_-	pfam01256, Carb_kinase, Carbohydrate kinase	NA|238aa|down_6|NZ_AP018034.1_3852913_3853627_-	NA	NA|285aa|down_7|NZ_AP018034.1_3853637_3854492_-	pfam14494, DUF4436, Domain of unknown function (DUF4436)	NA|625aa|down_8|NZ_AP018034.1_3854713_3856588_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|159aa|down_9|NZ_AP018034.1_3856609_3857086_+	pfam10708, DUF2510, Protein of unknown function (DUF2510)
GCF_002357935.1_ASM235793v1	NZ_AP018034	Mycobacterium tuberculosis strain HN-205	12	4109784-4109872	8	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCTCGGCGACGATGCGGGCCGGATGACGGCC	31	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,csm3gr7,csm2gr11,cas10,cas6	NA|257aa|up_6|NZ_AP018034.1_4100370_4101141_-,NA|233aa|up_0|NZ_AP018034.1_4108888_4109587_-,NA|126aa|down_6|NZ_AP018034.1_4115107_4115485_+	NA|388aa|up_9|NZ_AP018034.1_4096041_4097205_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|up_8|NZ_AP018034.1_4097201_4098254_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|up_7|NZ_AP018034.1_4098752_4099616_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|up_6|NZ_AP018034.1_4100370_4101141_-	NA	NA|549aa|up_5|NZ_AP018034.1_4101137_4102784_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|288aa|up_4|NZ_AP018034.1_4102780_4103644_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|up_3|NZ_AP018034.1_4103636_4104563_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|542aa|up_2|NZ_AP018034.1_4104564_4106190_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|652aa|up_1|NZ_AP018034.1_4106897_4108853_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|233aa|up_0|NZ_AP018034.1_4108888_4109587_-	NA	NA|173aa|down_0|NZ_AP018034.1_4109932_4110451_+	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|328aa|down_1|NZ_AP018034.1_4110451_4111435_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|398aa|down_2|NZ_AP018034.1_4111427_4112621_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|274aa|down_3|NZ_AP018034.1_4112626_4113448_-	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|228aa|down_4|NZ_AP018034.1_4113579_4114263_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|246aa|down_5|NZ_AP018034.1_4114262_4115000_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|126aa|down_6|NZ_AP018034.1_4115107_4115485_+	NA	NA|225aa|down_7|NZ_AP018034.1_4115583_4116258_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|265aa|down_8|NZ_AP018034.1_4116363_4117158_-	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|152aa|down_9|NZ_AP018034.1_4117164_4117620_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
