assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	1	332008-332805	1	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	CCGCCGGNGCCGCCGGNN	18	0	0	NA	NA	NA	15	15	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|398aa|up_9|NC_020245.2_321701_322895_-	COG3285, COG3285, Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]	NA|561aa|up_8|NC_020245.2_322930_324613_+	PRK07788, PRK07788, acyl-CoA synthetase; Validated	NA|732aa|up_7|NC_020245.2_324629_326825_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|378aa|up_6|NC_020245.2_326938_328072_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|207aa|up_5|NC_020245.2_328068_328689_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|194aa|up_4|NC_020245.2_328785_329367_+	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|242aa|up_3|NC_020245.2_329296_330022_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|307aa|up_2|NC_020245.2_330111_331032_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|143aa|up_1|NC_020245.2_331071_331500_-	cd18678, PIN_MtVapC25_VapC33-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC25, VapC33, and related proteins	NA|86aa|up_0|NC_020245.2_331523_331781_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|900aa|down_0|NC_020245.2_334879_337579_-	pfam00934, PE, PE family	NA|835aa|down_1|NC_020245.2_337828_340333_-	pfam00934, PE, PE family	NA|537aa|down_2|NC_020245.2_340623_342234_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|303aa|down_3|NC_020245.2_342257_343166_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|632aa|down_4|NC_020245.2_343390_345286_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|539aa|down_5|NC_020245.2_345282_346899_+	pfam05108, T7SS_ESX1_EccB, Type VII secretion system ESX-1, transport TM domain B	NA|1331aa|down_6|NC_020245.2_346895_350888_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|103aa|down_7|NC_020245.2_350884_351193_+	pfam00934, PE, PE family	NA|514aa|down_8|NC_020245.2_351195_352737_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|98aa|down_9|NC_020245.2_352785_353079_+	TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	2	339364-339586	1	PILER-CR	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	ATCCGCCGCTACCGCCGGTGCCGCCGGCGCCGAACAGCCCGCC	43	0	0	NA	NA	NA	2	2	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|732aa|up_9|NC_020245.2_324629_326825_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|378aa|up_8|NC_020245.2_326938_328072_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|207aa|up_7|NC_020245.2_328068_328689_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|194aa|up_6|NC_020245.2_328785_329367_+	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|242aa|up_5|NC_020245.2_329296_330022_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|307aa|up_4|NC_020245.2_330111_331032_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|143aa|up_3|NC_020245.2_331071_331500_-	cd18678, PIN_MtVapC25_VapC33-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC25, VapC33, and related proteins	NA|86aa|up_2|NC_020245.2_331523_331781_-	pfam01402, RHH_1, Ribbon-helix-helix protein, copG family	NA|917aa|up_1|NC_020245.2_331865_334616_-	pfam00934, PE, PE family	NA|900aa|up_0|NC_020245.2_334879_337579_-	pfam00934, PE, PE family	NA|537aa|down_0|NC_020245.2_340623_342234_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|303aa|down_1|NC_020245.2_342257_343166_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|632aa|down_2|NC_020245.2_343390_345286_+	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|539aa|down_3|NC_020245.2_345282_346899_+	pfam05108, T7SS_ESX1_EccB, Type VII secretion system ESX-1, transport TM domain B	NA|1331aa|down_4|NC_020245.2_346895_350888_+	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|103aa|down_5|NC_020245.2_350884_351193_+	pfam00934, PE, PE family	NA|514aa|down_6|NC_020245.2_351195_352737_+	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|98aa|down_7|NC_020245.2_352785_353079_+	TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target	NA|97aa|down_8|NC_020245.2_353108_353399_+	COG4842, COG4842, Uncharacterized protein conserved in bacteria [Function unknown]	NA|296aa|down_9|NC_020245.2_353409_354297_+	pfam14011, ESX-1_EspG, EspG family
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	3	693243-693319	1	CRISPRCasFinder	no	c2c9_V-U4	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type V-U4	TGAGGTGCGGCGTGAGCGCGGGT	23	0	0	NA	NA	NA	1	1	TypeV-U4	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA	NA|136aa|up_9|NC_020245.2_679136_679544_+	cd18696, PIN_MtVapC26-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC26 and related proteins	NA|229aa|up_8|NC_020245.2_679603_680290_-	pfam10738, Lpp-LpqN, Probable lipoprotein LpqN	NA|878aa|up_7|NC_020245.2_680443_683077_+	COG3537, COG3537, Putative alpha-1,2-mannosidase [Carbohydrate transport and metabolism]	NA|796aa|up_6|NC_020245.2_683099_685487_-	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|241aa|up_5|NC_020245.2_685624_686347_+	COG2186, FadR, Transcriptional regulators [Transcription]	NA|266aa|up_4|NC_020245.2_686343_687141_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|296aa|up_3|NC_020245.2_687142_688030_+	COG0767, Ttg2B, ABC-type transport system involved in resistance to organic solvents, permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|405aa|up_2|NC_020245.2_688035_689250_+	pfam11887, Mce4_CUP1, Cholesterol uptake porter CUP1 of Mce4, putative	NA|344aa|up_1|NC_020245.2_689246_690278_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|482aa|up_0|NC_020245.2_690274_691720_+	TIGR00996, Mtu_fam_mce, virulence factor Mce family protein	NA|517aa|down_0|NC_020245.2_694454_696005_+	COG1463, Ttg2C, ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|131aa|down_1|NC_020245.2_696056_696449_-	cd18768, PIN_MtVapC4-C5-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC4, VapC5, and related proteins	NA|86aa|down_2|NC_020245.2_696445_696703_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|412aa|down_3|NC_020245.2_696885_698121_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|138aa|down_4|NC_020245.2_698371_698785_-	cd18681, PIN_MtVapC27-VapC40_like, VapC-like PIN domain of Mycobacterium tuberculosis VapC27, and VapC40, and related proteins	NA|79aa|down_5|NC_020245.2_698781_699018_-	COG2002, AbrB, Regulators of stationary/sporulation gene expression [Transcription]	NA|169aa|down_6|NC_020245.2_699121_699628_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|157aa|down_7|NC_020245.2_699741_700212_-	PRK10755, PRK10755, two-component system sensor histidine kinase PmrB	NA|254aa|down_8|NC_020245.2_700255_701017_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|104aa|down_9|NC_020245.2_701073_701385_+	pfam03413, PepSY, Peptidase propeptide and YPEB domain
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	4	927375-928286	2	CRT	no	csa3	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type I-A	CGGGGCCGGCGGGGCCGGCGG	21	1	23	928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268|928242-928268	NC_020245.2_835302-835328|NC_020245.2_331980-331954|NC_020245.2_334994-334968|NC_020245.2_337940-337914|NC_020245.2_841553-841579|NC_020245.2_842831-842857|NC_020245.2_927108-927134|NC_020245.2_2373343-2373317|NC_020245.2_333216-333190|NC_020245.2_335945-335919|NC_020245.2_336284-336258|NC_020245.2_339059-339033|NC_020245.2_676343-676317|NC_020245.2_838674-838700|NC_020245.2_839772-839798|NC_020245.2_927009-927035|NC_020245.2_1215507-1215533|NC_020245.2_1651018-1650992|NC_020245.2_1840220-1840194|NC_020245.2_2032539-2032513|NC_020245.2_3795426-3795452|NC_020245.2_3924014-3924040|NC_020245.2_3924170-3924196	NA	19	19	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|184aa|down_2|NC_020245.2_932404_932956_-,NA|81aa|down_8|NC_020245.2_938431_938674_+	NA|685aa|up_9|NC_020245.2_915010_917065_-	TIGR00350, Transcriptional_regulator_LytR, cell envelope-related function transcriptional attenuator common domain	NA|390aa|up_8|NC_020245.2_917230_918400_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|339aa|up_7|NC_020245.2_918487_919504_-	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|214aa|up_6|NC_020245.2_919665_920307_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|352aa|up_5|NC_020245.2_920387_921443_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	csa3|131aa|up_4|NC_020245.2_921494_921887_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|141aa|up_3|NC_020245.2_921944_922367_-	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|97aa|up_2|NC_020245.2_922328_922619_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|302aa|up_1|NC_020245.2_922723_923629_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|272aa|up_0|NC_020245.2_923647_924463_-	TIGR04255, hypothetical_protein, TIGR04255 family protein	NA|883aa|down_0|NC_020245.2_928663_931312_-	pfam00934, PE, PE family	NA|215aa|down_1|NC_020245.2_931779_932424_+	pfam14032, PknH_C, PknH-like extracellular domain	NA|184aa|down_2|NC_020245.2_932404_932956_-	NA	NA|241aa|down_3|NC_020245.2_933036_933759_-	COG4849, COG4849, Predicted nucleotidyltransferase [General function prediction    only]	NA|343aa|down_4|NC_020245.2_933829_934858_-	COG4861, COG4861, Uncharacterized protein conserved in bacteria [Function unknown]	NA|261aa|down_5|NC_020245.2_935546_936329_+	pfam01427, Peptidase_M15, D-ala-D-ala dipeptidase	NA|271aa|down_6|NC_020245.2_936415_937228_+	pfam13847, Methyltransf_31, Methyltransferase domain	NA|287aa|down_7|NC_020245.2_937295_938156_-	TIGR01250, Proline_iminopeptidase, proline-specific peptidase, Bacillus coagulans-type subfamily	NA|81aa|down_8|NC_020245.2_938431_938674_+	NA	NA|431aa|down_9|NC_020245.2_938950_940243_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	5	929223-929352	2	PILER-CR	no	csa3	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type I-A	CCGCCGGCTCCGCCGGTGGCGCCGC	25	0	0	NA	NA	NA	2	2	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|184aa|down_1|NC_020245.2_932404_932956_-,NA|81aa|down_7|NC_020245.2_938431_938674_+	NA|390aa|up_9|NC_020245.2_917230_918400_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|339aa|up_8|NC_020245.2_918487_919504_-	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|214aa|up_7|NC_020245.2_919665_920307_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|352aa|up_6|NC_020245.2_920387_921443_+	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	csa3|131aa|up_5|NC_020245.2_921494_921887_-	smart00418, HTH_ARSR, helix_turn_helix, Arsenical Resistance Operon Repressor	NA|141aa|up_4|NC_020245.2_921944_922367_-	COG0590, CumB, Cytosine/adenosine deaminases [Nucleotide transport and metabolism / Translation, ribosomal structure and biogenesis]	NA|97aa|up_3|NC_020245.2_922328_922619_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|302aa|up_2|NC_020245.2_922723_923629_+	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|272aa|up_1|NC_020245.2_923647_924463_-	TIGR04255, hypothetical_protein, TIGR04255 family protein	NA|911aa|up_0|NC_020245.2_925704_928437_+	pfam00934, PE, PE family	NA|215aa|down_0|NC_020245.2_931779_932424_+	pfam14032, PknH_C, PknH-like extracellular domain	NA|184aa|down_1|NC_020245.2_932404_932956_-	NA	NA|241aa|down_2|NC_020245.2_933036_933759_-	COG4849, COG4849, Predicted nucleotidyltransferase [General function prediction    only]	NA|343aa|down_3|NC_020245.2_933829_934858_-	COG4861, COG4861, Uncharacterized protein conserved in bacteria [Function unknown]	NA|261aa|down_4|NC_020245.2_935546_936329_+	pfam01427, Peptidase_M15, D-ala-D-ala dipeptidase	NA|271aa|down_5|NC_020245.2_936415_937228_+	pfam13847, Methyltransf_31, Methyltransferase domain	NA|287aa|down_6|NC_020245.2_937295_938156_-	TIGR01250, Proline_iminopeptidase, proline-specific peptidase, Bacillus coagulans-type subfamily	NA|81aa|down_7|NC_020245.2_938431_938674_+	NA	NA|431aa|down_8|NC_020245.2_938950_940243_+	cd17329, MFS_MdtH_MDR_like, Multidrug resistance protein MdtH and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|335aa|down_9|NC_020245.2_940226_941231_+	COG1071, AcoA, Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	6	1213582-1214438	2	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GGCGGTGTCGGCGGTGCCGGCGG	23	4	77	1213965-1213986|1213965-1213986|1214010-1214025|1214148-1214169|1214148-1214169|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283|1214265-1214283	NC_020245.2_2894650-2894629|NC_020245.2_2894716-2894695|NC_020245.2_2059729-2059714|NC_020245.2_839136-839157|NC_020245.2_1219402-1219423|NC_020245.2_335381-335363|NC_020245.2_676007-675989|NC_020245.2_841385-841403|NC_020245.2_1219375-1219393|NC_020245.2_1219447-1219465|NC_020245.2_1625841-1625823|NC_020245.2_1628667-1628649|NC_020245.2_1631525-1631507|NC_020245.2_1632185-1632167|NC_020245.2_1971452-1971434|NC_020245.2_2032713-2032695|NC_020245.2_2033259-2033241|NC_020245.2_2735558-2735540|NC_020245.2_2739078-2739060|NC_020245.2_3917961-3917979|NC_020245.2_150233-150251|NC_020245.2_333372-333354|NC_020245.2_336488-336470|NC_020245.2_336941-336923|NC_020245.2_336992-336974|NC_020245.2_337001-336983|NC_020245.2_339272-339254|NC_020245.2_339659-339641|NC_020245.2_339688-339706|NC_020245.2_339782-339764|NC_020245.2_364359-364341|NC_020245.2_444573-444555|NC_020245.2_547917-547899|NC_020245.2_625207-625225|NC_020245.2_625429-625447|NC_020245.2_675173-675155|NC_020245.2_842333-842351|NC_020245.2_1091791-1091809|NC_020245.2_1092439-1092457|NC_020245.2_1096509-1096491|NC_020245.2_1214637-1214655|NC_020245.2_1215144-1215162|NC_020245.2_1215396-1215414|NC_020245.2_1215414-1215432|NC_020245.2_1219795-1219813|NC_020245.2_1486057-1486039|NC_020245.2_1613803-1613785|NC_020245.2_1614256-1614238|NC_020245.2_1631390-1631372|NC_020245.2_1631399-1631381|NC_020245.2_1839797-1839779|NC_020245.2_1840325-1840307|NC_020245.2_1970726-1970708|NC_020245.2_2059927-2059909|NC_020245.2_2256808-2256790|NC_020245.2_2308063-2308045|NC_020245.2_2369273-2369291|NC_020245.2_2373256-2373238|NC_020245.2_2515939-2515921|NC_020245.2_2628216-2628234|NC_020245.2_2728874-2728856|NC_020245.2_2729456-2729438|NC_020245.2_2977926-2977944|NC_020245.2_2978025-2978043|NC_020245.2_3699543-3699561|NC_020245.2_3730678-3730660|NC_020245.2_3731302-3731284|NC_020245.2_3731545-3731527|NC_020245.2_3731677-3731659|NC_020245.2_3733582-3733564|NC_020245.2_3794007-3794025|NC_020245.2_3794190-3794208|NC_020245.2_3915394-3915412|NC_020245.2_3917133-3917151|NC_020245.2_3917787-3917805|NC_020245.2_3924392-3924410|NC_020245.2_4012868-4012850	NA	16	16	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|89aa|up_3|NC_020245.2_1209014_1209281_+,NA|61aa|down_3|NC_020245.2_1216982_1217165_+	NA|465aa|up_9|NC_020245.2_1203348_1204743_+	TIGR01137, Cystathionine_beta-synthase, cystathionine beta-synthase	NA|241aa|up_8|NC_020245.2_1204944_1205667_+	pfam06271, RDD, RDD family	NA|389aa|up_7|NC_020245.2_1205698_1206865_+	PRK07811, PRK07811, cystathionine gamma-synthase; Provisional	NA|165aa|up_6|NC_020245.2_1206935_1207430_-	PRK00226, greA, transcription elongation factor GreA; Reviewed	NA|145aa|up_5|NC_020245.2_1207615_1208050_-	pfam14155, DUF4307, Domain of unknown function (DUF4307)	NA|289aa|up_4|NC_020245.2_1208151_1209018_+	TIGR03446, mycothiol_Mca, mycothiol conjugate amidase Mca	NA|89aa|up_3|NC_020245.2_1209014_1209281_+	NA	NA|674aa|up_2|NC_020245.2_1209267_1211289_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|243aa|up_1|NC_020245.2_1211387_1212116_-	TIGR01065, Hypothetical_UPF0073_protein_yqfA	NA|263aa|up_0|NC_020245.2_1212226_1213015_+	PRK14828, PRK14828, undecaprenyl pyrophosphate synthase; Provisional	NA|107aa|down_0|NC_020245.2_1215797_1216118_+	COG0020, UppS, Undecaprenyl pyrophosphate synthase [Lipid metabolism]	NA|145aa|down_1|NC_020245.2_1216270_1216705_+	pfam00934, PE, PE family	NA|55aa|down_2|NC_020245.2_1216806_1216971_+	smart00637, CBD_II, CBD_II domain	NA|61aa|down_3|NC_020245.2_1216982_1217165_+	NA	NA|152aa|down_4|NC_020245.2_1217356_1217812_+	pfam01670, Glyco_hydro_12, Glycosyl hydrolase family 12	NA|851aa|down_5|NC_020245.2_1218226_1220779_+	pfam00934, PE, PE family	NA|313aa|down_6|NC_020245.2_1220996_1221935_-	PRK05439, PRK05439, pantothenate kinase; Provisional	NA|427aa|down_7|NC_020245.2_1222322_1223603_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|276aa|down_8|NC_020245.2_1223707_1224535_+	cd01050, Acyl_ACP_Desat, Acyl ACP desaturase, ferritin-like diiron-binding domain	NA|434aa|down_9|NC_020245.2_1224745_1226047_+	COG1875, COG1875, NYN ribonuclease and ATPase of PhoH family domains [General    function prediction only]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	7	2059399-2059653	3	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCCNCCGTCGCCGCCNNTGCC	21	2	3	2059480-2059497|2059480-2059497|2059570-2059587	NC_020245.2_402339-402322|NC_020245.2_608569-608552|NC_020245.2_3373996-3373979	NA	5	5	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|88aa|up_0|NC_020245.2_2058758_2059022_-,NA	NA|165aa|up_9|NC_020245.2_2045108_2045603_+	pfam02577, DNase-RNase, Bifunctional nuclease	NA|226aa|up_8|NC_020245.2_2045950_2046628_+	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|942aa|up_7|NC_020245.2_2046986_2049812_+	PRK05367, PRK05367, aminomethyl-transferring glycine dehydrogenase	NA|287aa|up_6|NC_020245.2_2050038_2050899_-	PRK03204, PRK03204, haloalkane dehalogenase; Provisional	NA|289aa|up_5|NC_020245.2_2050939_2051806_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|629aa|up_4|NC_020245.2_2051810_2053697_-	TIGR00976, Hypothetical_protein_Rv1835c/MT1883/Mb1866c	NA|678aa|up_3|NC_020245.2_2053712_2055746_-	cd01456, vWA_ywmD_type, VWA ywmD type:Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	NA|742aa|up_2|NC_020245.2_2055865_2058091_-	PRK02999, PRK02999, malate synthase G; Provisional	NA|132aa|up_1|NC_020245.2_2058366_2058762_-	COG1848, COG1848, Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only]	NA|88aa|up_0|NC_020245.2_2058758_2059022_-	NA	NA|350aa|down_0|NC_020245.2_2060789_2061839_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|456aa|down_1|NC_020245.2_2061838_2063206_-	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|479aa|down_2|NC_020245.2_2063379_2064816_-	PRK07807, PRK07807, GuaB1 family IMP dehydrogenase-related protein	NA|317aa|down_3|NC_020245.2_2066331_2067282_-	cd07326, M56_BlaR1_MecR1_like, Peptidase M56-like including those in BlaR1 and MecR1, integral membrane metallopeptidase	NA|139aa|down_4|NC_020245.2_2067296_2067713_-	COG3682, COG3682, Predicted transcriptional regulator [Transcription]	NA|141aa|down_5|NC_020245.2_2067990_2068413_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|101aa|down_6|NC_020245.2_2068461_2068764_+	pfam00547, Urease_gamma, Urease, gamma subunit	NA|105aa|down_7|NC_020245.2_2068760_2069075_+	PRK13202, ureB, urease subunit beta; Reviewed	NA|578aa|down_8|NC_020245.2_2069074_2070808_+	PRK13206, ureC, urease subunit alpha; Reviewed	NA|212aa|down_9|NC_020245.2_2070807_2071443_+	COG0830, UreF, Urease accessory protein UreF [Posttranslational modification, protein turnover, chaperones]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	8	2140620-2140745	3	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	TGCCAGCCGGAATCGTGATCGGCGGAACCGTCACCGACGGAATACTCA	48	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|136aa|up_2|NC_020245.2_2131105_2131513_-,NA|127aa|down_5|NC_020245.2_2148354_2148735_-	NA|216aa|up_9|NC_020245.2_2124419_2125067_-	pfam14081, DUF4262, Domain of unknown function (DUF4262)	NA|741aa|up_8|NC_020245.2_2125073_2127296_-	PRK15061, PRK15061, catalase/peroxidase	NA|148aa|up_7|NC_020245.2_2127333_2127777_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|198aa|up_6|NC_020245.2_2127890_2128484_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|202aa|up_5|NC_020245.2_2128566_2129172_-	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|335aa|up_4|NC_020245.2_2129271_2130276_-	cd08275, MDR3, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|251aa|up_3|NC_020245.2_2130375_2131128_+	cd16282, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|136aa|up_2|NC_020245.2_2131105_2131513_-	NA	NA|767aa|up_1|NC_020245.2_2131647_2133948_+	PLN02892, PLN02892, isocitrate lyase	NA|1666aa|up_0|NC_020245.2_2134117_2139115_-	pfam00823, PPE, PPE family	NA|155aa|down_0|NC_020245.2_2142865_2143330_-	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|288aa|down_1|NC_020245.2_2143427_2144291_+	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|424aa|down_2|NC_020245.2_2144328_2145600_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|372aa|down_3|NC_020245.2_2145871_2146987_+	COG1680, AmpC, Beta-lactamase class C and other penicillin binding proteins [Defense mechanisms]	NA|447aa|down_4|NC_020245.2_2146977_2148318_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|127aa|down_5|NC_020245.2_2148354_2148735_-	NA	NA|621aa|down_6|NC_020245.2_2148891_2150754_+	PRK12476, PRK12476, putative fatty-acid--CoA ligase; Provisional	NA|160aa|down_7|NC_020245.2_2150761_2151241_-	pfam09167, DUF1942, Domain of unknown function (DUF1942)	NA|258aa|down_8|NC_020245.2_2151477_2152251_+	COG3361, COG3361, Uncharacterized conserved protein [Function unknown]	NA|256aa|down_9|NC_020245.2_2152254_2153022_-	PRK05867, PRK05867, SDR family oxidoreductase
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	9	3042281-3043708	3,4,4	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-B,Type III-A,Type III-D,Type III-C	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	18,18,19	19	TypeIII-B,TypeIII-A,TypeIII-D,TypeIII-C	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_8|NC_020245.2_3035560_3035815_-,NA|135aa|up_7|NC_020245.2_3035962_3036367_+,NA|64aa|up_6|NC_020245.2_3036363_3036555_+,NA|86aa|up_4|NC_020245.2_3038141_3038399_+,NA|104aa|up_3|NC_020245.2_3038503_3038815_+,NA|203aa|up_2|NC_020245.2_3039234_3039843_+,NA	NA|92aa|up_9|NC_020245.2_3035109_3035385_+	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|85aa|up_8|NC_020245.2_3035560_3035815_-	NA	NA|135aa|up_7|NC_020245.2_3035962_3036367_+	NA	NA|64aa|up_6|NC_020245.2_3036363_3036555_+	NA	NA|385aa|up_5|NC_020245.2_3036753_3037908_+	pfam00665, rve, Integrase core domain	NA|86aa|up_4|NC_020245.2_3038141_3038399_+	NA	NA|104aa|up_3|NC_020245.2_3038503_3038815_+	NA	NA|203aa|up_2|NC_020245.2_3039234_3039843_+	NA	NA|470aa|up_1|NC_020245.2_3039913_3041323_+	pfam00665, rve, Integrase core domain	NA|271aa|up_0|NC_020245.2_3041319_3042132_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|down_0|NC_020245.2_3043734_3044996_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_1|NC_020245.2_3047249_3047591_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_2|NC_020245.2_3047591_3048608_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	csm5gr7|376aa|down_3|NC_020245.2_3049864_3050992_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|303aa|down_4|NC_020245.2_3050988_3051897_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_5|NC_020245.2_3051877_3052588_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_6|NC_020245.2_3052597_3052972_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|813aa|down_7|NC_020245.2_3052968_3055407_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_8|NC_020245.2_3055403_3056126_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_9|NC_020245.2_3056525_3057071_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	10	3045031-3047201	5,5,4	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Type III-B,Type III-A,Type III-D,Type III-C	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	29,29,28	29	TypeIII-B,TypeIII-A,TypeIII-D,TypeIII-C	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|85aa|up_9|NC_020245.2_3035560_3035815_-,NA|135aa|up_8|NC_020245.2_3035962_3036367_+,NA|64aa|up_7|NC_020245.2_3036363_3036555_+,NA|86aa|up_5|NC_020245.2_3038141_3038399_+,NA|104aa|up_4|NC_020245.2_3038503_3038815_+,NA|203aa|up_3|NC_020245.2_3039234_3039843_+,NA	NA|85aa|up_9|NC_020245.2_3035560_3035815_-	NA	NA|135aa|up_8|NC_020245.2_3035962_3036367_+	NA	NA|64aa|up_7|NC_020245.2_3036363_3036555_+	NA	NA|385aa|up_6|NC_020245.2_3036753_3037908_+	pfam00665, rve, Integrase core domain	NA|86aa|up_5|NC_020245.2_3038141_3038399_+	NA	NA|104aa|up_4|NC_020245.2_3038503_3038815_+	NA	NA|203aa|up_3|NC_020245.2_3039234_3039843_+	NA	NA|470aa|up_2|NC_020245.2_3039913_3041323_+	pfam00665, rve, Integrase core domain	NA|271aa|up_1|NC_020245.2_3041319_3042132_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|up_0|NC_020245.2_3043734_3044996_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_0|NC_020245.2_3047249_3047591_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_1|NC_020245.2_3047591_3048608_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	csm5gr7|376aa|down_2|NC_020245.2_3049864_3050992_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|303aa|down_3|NC_020245.2_3050988_3051897_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_4|NC_020245.2_3051877_3052588_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_5|NC_020245.2_3052597_3052972_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|813aa|down_6|NC_020245.2_3052968_3055407_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	cas6|241aa|down_7|NC_020245.2_3055403_3056126_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_8|NC_020245.2_3056525_3057071_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|295aa|down_9|NC_020245.2_3057342_3058227_-	COG2253, COG2253, Uncharacterized conserved protein [Function unknown]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	11	3930510-3931254	6	CRT	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCNNCGGCGGNNCCGGCGGNNNCGGCGG	28	2	3	3930628-3930674|3930703-3930728|3930703-3930728	NC_020245.2_3930277-3930323|NC_020245.2_3917889-3917914|NC_020245.2_3930352-3930377	NA	10	10	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA,NA|280aa|down_2|NC_020245.2_3934569_3935409_+	NA|374aa|up_9|NC_020245.2_3908599_3909721_+	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|503aa|up_8|NC_020245.2_3909791_3911300_+	PRK07867, PRK07867, acyl-CoA synthetase; Validated	NA|1373aa|up_7|NC_020245.2_3911470_3915589_+	pfam00934, PE, PE family	NA|1489aa|up_6|NC_020245.2_3915879_3920346_+	pfam00934, PE, PE family	NA|516aa|up_5|NC_020245.2_3920512_3922060_-	PRK07586, PRK07586, acetolactate synthase large subunit	NA|279aa|up_4|NC_020245.2_3922056_3922893_-	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|126aa|up_3|NC_020245.2_3925401_3925779_+	PRK12270, kgd, multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit	NA|672aa|up_2|NC_020245.2_3925812_3927828_+	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|172aa|up_1|NC_020245.2_3927824_3928340_+	PHA03169, PHA03169, hypothetical protein; Provisional	NA|219aa|up_0|NC_020245.2_3928468_3929125_-	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|549aa|down_0|NC_020245.2_3931962_3933609_-	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|264aa|down_1|NC_020245.2_3933682_3934474_+	PRK07799, PRK07799, crotonase/enoyl-CoA hydratase family protein	NA|280aa|down_2|NC_020245.2_3934569_3935409_+	NA	NA|237aa|down_3|NC_020245.2_3936687_3937398_+	pfam06314, ADC, Acetoacetate decarboxylase (ADC)	NA|348aa|down_4|NC_020245.2_3937462_3938506_-	TIGR03559, F420_Rv3520c, probable F420-dependent oxidoreductase, Rv3520c family	NA|304aa|down_5|NC_020245.2_3938658_3939570_+	COG1545, COG1545, Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]	NA|355aa|down_6|NC_020245.2_3939585_3940650_+	PRK07937, PRK07937, lipid-transfer protein; Provisional	NA|395aa|down_7|NC_020245.2_3940666_3941851_+	PRK08313, PRK08313, thiolase domain-containing protein	NA|344aa|down_8|NC_020245.2_3941892_3942924_+	cd14952, NHL_PKND_like, NHL repeat domain of the protein kinase PknD	NA|175aa|down_9|NC_020245.2_3942937_3943462_-	COG0663, PaaY, Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	12	4069374-4069699	6	CRISPRCasFinder	no	cas3	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Unclear	CGCCGGGCTGTTCGGCGACGGCGGC	25	1	34	4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420|4069399-4069420	NC_020245.2_970581-970560|NC_020245.2_1839898-1839877|NC_020245.2_2373714-2373693|NC_020245.2_3730677-3730656|NC_020245.2_3929552-3929573|NC_020245.2_335194-335173|NC_020245.2_335380-335359|NC_020245.2_336487-336466|NC_020245.2_336991-336970|NC_020245.2_339085-339064|NC_020245.2_673732-673711|NC_020245.2_675172-675151|NC_020245.2_841233-841254|NC_020245.2_929161-929140|NC_020245.2_970101-970080|NC_020245.2_1091792-1091813|NC_020245.2_1190751-1190730|NC_020245.2_1193276-1193255|NC_020245.2_1485405-1485384|NC_020245.2_1486056-1486035|NC_020245.2_1632184-1632163|NC_020245.2_1970872-1970851|NC_020245.2_1972102-1972081|NC_020245.2_2033066-2033045|NC_020245.2_2738053-2738032|NC_020245.2_3086355-3086376|NC_020245.2_3733581-3733560|NC_020245.2_3772106-3772127|NC_020245.2_3919084-3919105|NC_020245.2_3919408-3919429|NC_020245.2_3924459-3924480|NC_020245.2_3926553-3926574|NC_020245.2_4012768-4012747|NC_020245.2_4012867-4012846	NA	5	5	Unclear	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|64aa|up_9|NC_020245.2_4056426_4056618_+,NA|193aa|up_5|NC_020245.2_4062876_4063455_-,NA|100aa|down_1|NC_020245.2_4070358_4070658_-,NA|257aa|down_8|NC_020245.2_4076623_4077394_-	NA|64aa|up_9|NC_020245.2_4056426_4056618_+	NA	NA|402aa|up_8|NC_020245.2_4056782_4057988_-	PRK07940, PRK07940, DNA polymerase III subunit delta'; Validated	NA|550aa|up_7|NC_020245.2_4058073_4059723_+	cd07302, CHD, cyclase homology domain	NA|935aa|up_6|NC_020245.2_4059719_4062524_-	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|193aa|up_5|NC_020245.2_4062876_4063455_-	NA	NA|68aa|up_4|NC_020245.2_4063594_4063798_-	COG1278, CspC, Cold shock proteins [Transcription]	cas3|772aa|up_3|NC_020245.2_4064047_4066363_+	TIGR03817, DECH_helic, helicase/secretion neighborhood putative DEAH-box helicase	NA|95aa|up_2|NC_020245.2_4066499_4066784_+	pfam00934, PE, PE family	NA|346aa|up_1|NC_020245.2_4067107_4068145_+	pfam18621, DUF5628, Family of unknown function (DUF5628)	NA|105aa|up_0|NC_020245.2_4068898_4069213_+	pfam00934, PE, PE family	NA|126aa|down_0|NC_020245.2_4070018_4070396_-	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|100aa|down_1|NC_020245.2_4070358_4070658_-	NA	NA|69aa|down_2|NC_020245.2_4070681_4070888_-	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|192aa|down_3|NC_020245.2_4070897_4071473_-	COG2064, TadC, Flp pilus assembly protein TadC [Cell motility and secretion / Intracellular trafficking and secretion]	NA|267aa|down_4|NC_020245.2_4071496_4072297_-	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|388aa|down_5|NC_020245.2_4072293_4073457_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|down_6|NC_020245.2_4073453_4074506_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|down_7|NC_020245.2_4075005_4075869_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|down_8|NC_020245.2_4076623_4077394_-	NA	NA|549aa|down_9|NC_020245.2_4077390_4079037_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	13	4069776-4069854	7	CRISPRCasFinder	no	cas3	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Unclear	CGCCGGGCTGTTCGGCGACGGCGGC	25	0	0	NA	NA	NA	1	1	Unclear	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|64aa|up_9|NC_020245.2_4056426_4056618_+,NA|193aa|up_5|NC_020245.2_4062876_4063455_-,NA|100aa|down_1|NC_020245.2_4070358_4070658_-,NA|257aa|down_8|NC_020245.2_4076623_4077394_-	NA|64aa|up_9|NC_020245.2_4056426_4056618_+	NA	NA|402aa|up_8|NC_020245.2_4056782_4057988_-	PRK07940, PRK07940, DNA polymerase III subunit delta'; Validated	NA|550aa|up_7|NC_020245.2_4058073_4059723_+	cd07302, CHD, cyclase homology domain	NA|935aa|up_6|NC_020245.2_4059719_4062524_-	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|193aa|up_5|NC_020245.2_4062876_4063455_-	NA	NA|68aa|up_4|NC_020245.2_4063594_4063798_-	COG1278, CspC, Cold shock proteins [Transcription]	cas3|772aa|up_3|NC_020245.2_4064047_4066363_+	TIGR03817, DECH_helic, helicase/secretion neighborhood putative DEAH-box helicase	NA|95aa|up_2|NC_020245.2_4066499_4066784_+	pfam00934, PE, PE family	NA|346aa|up_1|NC_020245.2_4067107_4068145_+	pfam18621, DUF5628, Family of unknown function (DUF5628)	NA|105aa|up_0|NC_020245.2_4068898_4069213_+	pfam00934, PE, PE family	NA|126aa|down_0|NC_020245.2_4070018_4070396_-	TIGR03816, tadE_like_DECH, helicase/secretion neighborhood TadE-like protein	NA|100aa|down_1|NC_020245.2_4070358_4070658_-	NA	NA|69aa|down_2|NC_020245.2_4070681_4070888_-	pfam14029, DUF4244, Protein of unknown function (DUF4244)	NA|192aa|down_3|NC_020245.2_4070897_4071473_-	COG2064, TadC, Flp pilus assembly protein TadC [Cell motility and secretion / Intracellular trafficking and secretion]	NA|267aa|down_4|NC_020245.2_4071496_4072297_-	COG4965, TadB, Flp pilus assembly protein TadB [Intracellular trafficking and secretion]	NA|388aa|down_5|NC_020245.2_4072293_4073457_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|down_6|NC_020245.2_4073453_4074506_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|down_7|NC_020245.2_4075005_4075869_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|down_8|NC_020245.2_4076623_4077394_-	NA	NA|549aa|down_9|NC_020245.2_4077390_4079037_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]
GCF_000338715.2_ASM33871v2	NC_020245	Mycobacterium tuberculosis variant bovis BCG str. Korea 1168P, complete sequence	14	4086037-4086125	8	CRISPRCasFinder	no		RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	Orphan	GCTCGGCGACGATGCGGGCCGGATGACGGCC	31	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,cas6	NA|257aa|up_6|NC_020245.2_4076623_4077394_-,NA|233aa|up_0|NC_020245.2_4085141_4085840_-,NA|126aa|down_6|NC_020245.2_4091360_4091738_+	NA|388aa|up_9|NC_020245.2_4072293_4073457_-	TIGR03819, heli_sec_ATPase, helicase/secretion neighborhood ATPase	NA|351aa|up_8|NC_020245.2_4073453_4074506_-	TIGR03815, CpaE_hom_Actino, helicase/secretion neighborhood CpaE-like protein	NA|288aa|up_7|NC_020245.2_4075005_4075869_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|up_6|NC_020245.2_4076623_4077394_-	NA	NA|549aa|up_5|NC_020245.2_4077390_4079037_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|288aa|up_4|NC_020245.2_4079033_4079897_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|up_3|NC_020245.2_4079889_4080816_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|542aa|up_2|NC_020245.2_4080817_4082443_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|652aa|up_1|NC_020245.2_4083150_4085106_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|233aa|up_0|NC_020245.2_4085141_4085840_-	NA	NA|173aa|down_0|NC_020245.2_4086185_4086704_+	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|328aa|down_1|NC_020245.2_4086704_4087688_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|398aa|down_2|NC_020245.2_4087680_4088874_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|274aa|down_3|NC_020245.2_4088879_4089701_-	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|228aa|down_4|NC_020245.2_4089832_4090516_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|246aa|down_5|NC_020245.2_4090515_4091253_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|126aa|down_6|NC_020245.2_4091360_4091738_+	NA	NA|225aa|down_7|NC_020245.2_4091836_4092511_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|265aa|down_8|NC_020245.2_4092616_4093411_-	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|152aa|down_9|NC_020245.2_4093417_4093873_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
