assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003006115.1_ASM300611v1	CP022578	Mycobacterium tuberculosis strain WC059 chromosome, complete genome	1	431179-432326	1,1,1	CRT,PILER-CR,CRISPRCasFinder	no	cas6,cas10,csm2gr11,csm3gr7,csm4gr5,c2c9_V-U4	c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	Type III-A,Type III-D,Type III-B,Type III-C	GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC,GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC,GTCGTCAGACCCAAAACCCCGAGAGGGGACGGAAAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	15,14,14	15	TypeIII-A,TypeIII-D,TypeIII-B,TypeIII-C	c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	NA,NA|203aa|down_2|CP022578.1_434763_435372_-,NA|104aa|down_3|CP022578.1_435791_436103_-,NA|86aa|down_4|CP022578.1_436207_436465_-,NA|64aa|down_6|CP022578.1_438051_438243_-,NA|135aa|down_7|CP022578.1_438239_438644_-	NA|182aa|up_9|CP022578.1_421432_421978_+	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|296aa|up_8|CP022578.1_422282_423170_+	pfam09407, AbiEi_1, AbiEi antitoxin C-terminal domain	NA|295aa|up_7|CP022578.1_423172_424057_+	COG2253, COG2253, Uncharacterized conserved protein [Function unknown]	NA|182aa|up_6|CP022578.1_424328_424874_+	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	cas6|315aa|up_5|CP022578.1_425051_425996_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas10|813aa|up_4|CP022578.1_425992_428431_+	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	csm2gr11|125aa|up_3|CP022578.1_428427_428802_+	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	csm3gr7|237aa|up_2|CP022578.1_428811_429522_+	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm4gr5|119aa|up_1|CP022578.1_429502_429859_+	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|421aa|up_0|CP022578.1_429885_431147_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|271aa|down_0|CP022578.1_432474_433287_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|470aa|down_1|CP022578.1_433283_434693_-	pfam00665, rve, Integrase core domain	NA|203aa|down_2|CP022578.1_434763_435372_-	NA	NA|104aa|down_3|CP022578.1_435791_436103_-	NA	NA|86aa|down_4|CP022578.1_436207_436465_-	NA	NA|385aa|down_5|CP022578.1_436698_437853_-	pfam00665, rve, Integrase core domain	NA|64aa|down_6|CP022578.1_438051_438243_-	NA	NA|135aa|down_7|CP022578.1_438239_438644_-	NA	NA|122aa|down_8|CP022578.1_438716_439082_-	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|156aa|down_9|CP022578.1_439221_439689_-	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]
GCA_003006115.1_ASM300611v1	CP022578	Mycobacterium tuberculosis strain WC059 chromosome, complete genome	9	3214234-3215281	4	CRT	no		c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	Orphan	CGGNNCCGGCGGNNNCGGCGG	21	1	5	3215198-3215221|3215198-3215221|3215198-3215221|3215198-3215221|3215198-3215221	CP022578.1_3212192-3212215|CP022578.1_1125030-1125053|CP022578.1_2707641-2707618|CP022578.1_1688317-1688340|CP022578.1_2621247-2621224	NA	16	16	Orphan	c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	NA|87aa|up_1|CP022578.1_3209312_3209573_-,NA|58aa|down_0|CP022578.1_3215489_3215663_+	NA|98aa|up_9|CP022578.1_3196861_3197155_-	TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target	NA|514aa|up_8|CP022578.1_3197203_3198745_-	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|103aa|up_7|CP022578.1_3198747_3199056_-	pfam00934, PE, PE family	NA|1331aa|up_6|CP022578.1_3199052_3203045_-	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|539aa|up_5|CP022578.1_3203041_3204658_-	pfam05108, T7SS_ESX1_EccB, Type VII secretion system ESX-1, transport TM domain B	NA|632aa|up_4|CP022578.1_3204654_3206550_-	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|303aa|up_3|CP022578.1_3206773_3207682_-	COG3315, COG3315, O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism]	NA|537aa|up_2|CP022578.1_3207705_3209316_-	COG5651, COG5651, PPE-repeat proteins [Cell motility and secretion]	NA|87aa|up_1|CP022578.1_3209312_3209573_-	NA	NA|903aa|up_0|CP022578.1_3209606_3212315_+	pfam00934, PE, PE family	NA|58aa|down_0|CP022578.1_3215489_3215663_+	NA	NA|143aa|down_1|CP022578.1_3215686_3216115_+	cd18678, PIN_MtVapC25_VapC33-like, VapC-like PIN domain of Mycobacterium tuberculosis VapC25, VapC33, and related proteins	NA|307aa|down_2|CP022578.1_3216154_3217075_-	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|242aa|down_3|CP022578.1_3217164_3217890_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|194aa|down_4|CP022578.1_3217819_3218401_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|207aa|down_5|CP022578.1_3218497_3219118_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|378aa|down_6|CP022578.1_3219114_3220248_+	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|732aa|down_7|CP022578.1_3220361_3222557_+	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|561aa|down_8|CP022578.1_3222573_3224256_-	PRK07788, PRK07788, acyl-CoA synthetase; Validated	NA|398aa|down_9|CP022578.1_3224291_3225485_+	COG3285, COG3285, Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair]
GCA_003006115.1_ASM300611v1	CP022578	Mycobacterium tuberculosis strain WC059 chromosome, complete genome	10	3848103-3848191	7	CRISPRCasFinder	no		c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	Orphan	GGCCGTCATCCGGCCCGCATCGTCGCCGAGC	31	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	NA|126aa|up_6|CP022578.1_3842489_3842867_-,NA|233aa|down_0|CP022578.1_3848387_3849086_+,NA|90aa|down_2|CP022578.1_3851342_3851612_-,NA|257aa|down_7|CP022578.1_3856833_3857604_+,NA|65aa|down_9|CP022578.1_3859304_3859499_-	NA|152aa|up_9|CP022578.1_3840354_3840810_+	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|265aa|up_8|CP022578.1_3840816_3841611_+	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|225aa|up_7|CP022578.1_3841716_3842391_-	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|126aa|up_6|CP022578.1_3842489_3842867_-	NA	NA|246aa|up_5|CP022578.1_3842974_3843712_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|228aa|up_4|CP022578.1_3843711_3844395_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|274aa|up_3|CP022578.1_3844526_3845348_+	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|398aa|up_2|CP022578.1_3845353_3846547_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|328aa|up_1|CP022578.1_3846539_3847523_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|173aa|up_0|CP022578.1_3847523_3848042_-	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|233aa|down_0|CP022578.1_3848387_3849086_+	NA	NA|652aa|down_1|CP022578.1_3849121_3851077_-	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|90aa|down_2|CP022578.1_3851342_3851612_-	NA	NA|542aa|down_3|CP022578.1_3851784_3853410_+	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|309aa|down_4|CP022578.1_3853411_3854338_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|267aa|down_5|CP022578.1_3854393_3855194_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|549aa|down_6|CP022578.1_3855190_3856837_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|257aa|down_7|CP022578.1_3856833_3857604_+	NA	NA|288aa|down_8|CP022578.1_3858358_3859222_-	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|65aa|down_9|CP022578.1_3859304_3859499_-	NA
GCA_003006115.1_ASM300611v1	CP022578	Mycobacterium tuberculosis strain WC059 chromosome, complete genome	11	4009084-4010248	5	CRT	no		c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	Orphan	GCCGCCGNNNCCGCCGGNGCCGCCNNNGCCGCCGGNNCCGCCG	43	10	10	4009397-4009416|4009460-4009500|4009544-4009572|4009685-4009719|4009763-4009809|4009853-4009878|4009922-4009959|4010003-4010031|4010075-4010106|4010150-4010205	CP022578.1_4010612-4010631|CP022578.1_4010675-4010715|CP022578.1_4010759-4010787|CP022578.1_4010294-4010328|CP022578.1_4010372-4010418|CP022578.1_4010462-4010487|CP022578.1_4010531-4010568|CP022578.1_4010612-4010640|CP022578.1_4010684-4010715|CP022578.1_4010759-4010814	NA	14	14	Orphan	c2c9_V-U4,cas6,cas10,csm2gr11,csm3gr7,csm4gr5,csa3,DEDDh,cas4,WYL,DinG,cas3	NA|280aa|up_3|CP022578.1_4004947_4005787_-,NA|73aa|up_0|CP022578.1_4008390_4008609_+,NA	NA|395aa|up_9|CP022578.1_3998504_3999689_-	PRK08313, PRK08313, thiolase domain-containing protein	NA|355aa|up_8|CP022578.1_3999705_4000770_-	PRK07937, PRK07937, lipid-transfer protein; Provisional	NA|304aa|up_7|CP022578.1_4000785_4001697_-	COG1545, COG1545, Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]	NA|348aa|up_6|CP022578.1_4001849_4002893_+	TIGR03559, F420_Rv3520c, probable F420-dependent oxidoreductase, Rv3520c family	NA|237aa|up_5|CP022578.1_4002957_4003668_-	pfam06314, ADC, Acetoacetate decarboxylase (ADC)	NA|399aa|up_4|CP022578.1_4003696_4004893_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|280aa|up_3|CP022578.1_4004947_4005787_-	NA	NA|264aa|up_2|CP022578.1_4005882_4006674_-	PRK07799, PRK07799, crotonase/enoyl-CoA hydratase family protein	NA|549aa|up_1|CP022578.1_4006747_4008394_+	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|73aa|up_0|CP022578.1_4008390_4008609_+	NA	NA|219aa|down_0|CP022578.1_4013666_4014323_+	PRK07798, PRK07798, acyl-CoA synthetase; Validated	NA|2141aa|down_1|CP022578.1_4014451_4020874_-	pfam00934, PE, PE family	NA|279aa|down_2|CP022578.1_4021233_4022070_+	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|516aa|down_3|CP022578.1_4022066_4023614_+	PRK07586, PRK07586, acetolactate synthase large subunit	NA|1621aa|down_4|CP022578.1_4023780_4028643_-	pfam00934, PE, PE family	NA|1384aa|down_5|CP022578.1_4028933_4033085_-	pfam00934, PE, PE family	NA|503aa|down_6|CP022578.1_4033255_4034764_-	PRK07867, PRK07867, acyl-CoA synthetase; Validated	NA|374aa|down_7|CP022578.1_4034834_4035956_-	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|401aa|down_8|CP022578.1_4035980_4037183_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|64aa|down_9|CP022578.1_4037397_4037589_+	COG1141, Fer, Ferredoxin [Energy production and conversion]
