assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_002886585.1_ASM288658v1	CP025608	Mycobacterium tuberculosis strain GG-229-10 chromosome, complete genome	11	3119166-3120519	1,8,4	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	Type III-A,Type III-D,Type III-B,Type III-C	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	16,17,18	18	TypeIII-A,TypeIII-D,TypeIII-B,TypeIII-C	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	NA|135aa|up_7|CP025608.1_3112847_3113252_+,NA|64aa|up_6|CP025608.1_3113248_3113440_+,NA|86aa|up_4|CP025608.1_3115026_3115284_+,NA|104aa|up_3|CP025608.1_3115388_3115700_+,NA|203aa|up_2|CP025608.1_3116119_3116728_+,NA	NA|92aa|up_9|CP025608.1_3111994_3112270_+	COG4453, COG4453, Uncharacterized protein conserved in bacteria [Function unknown]	NA|122aa|up_8|CP025608.1_3112409_3112775_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|135aa|up_7|CP025608.1_3112847_3113252_+	NA	NA|64aa|up_6|CP025608.1_3113248_3113440_+	NA	NA|385aa|up_5|CP025608.1_3113638_3114793_+	pfam00665, rve, Integrase core domain	NA|86aa|up_4|CP025608.1_3115026_3115284_+	NA	NA|104aa|up_3|CP025608.1_3115388_3115700_+	NA	NA|203aa|up_2|CP025608.1_3116119_3116728_+	NA	NA|470aa|up_1|CP025608.1_3116798_3118208_+	pfam00665, rve, Integrase core domain	NA|271aa|up_0|CP025608.1_3118204_3119017_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|down_0|CP025608.1_3120545_3121807_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_1|CP025608.1_3123609_3123951_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_2|CP025608.1_3123951_3124968_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	csm6|383aa|down_3|CP025608.1_3124980_3126129_-	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm5gr7|376aa|down_4|CP025608.1_3126224_3127352_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|303aa|down_5|CP025608.1_3127348_3128257_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_6|CP025608.1_3128237_3128948_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_7|CP025608.1_3128957_3129332_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|816aa|down_8|CP025608.1_3129328_3131776_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	NA|182aa|down_9|CP025608.1_3132893_3133439_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]
GCA_002886585.1_ASM288658v1	CP025608	Mycobacterium tuberculosis strain GG-229-10 chromosome, complete genome	12	3121842-3123561	9,5,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	Type III-A,Type III-D,Type III-B,Type III-C	GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC,GTTTCCGTCCCCTCTCGGGGTTTTGGGTCTGACGAC	36,36,36	0	0	NA	NA	II-B,III-A:II-B,III-A:II-B,III-A	23,23,22	23	TypeIII-A,TypeIII-D,TypeIII-B,TypeIII-C	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	NA|135aa|up_8|CP025608.1_3112847_3113252_+,NA|64aa|up_7|CP025608.1_3113248_3113440_+,NA|86aa|up_5|CP025608.1_3115026_3115284_+,NA|104aa|up_4|CP025608.1_3115388_3115700_+,NA|203aa|up_3|CP025608.1_3116119_3116728_+,NA	NA|122aa|up_9|CP025608.1_3112409_3112775_+	pfam13673, Acetyltransf_10, Acetyltransferase (GNAT) domain	NA|135aa|up_8|CP025608.1_3112847_3113252_+	NA	NA|64aa|up_7|CP025608.1_3113248_3113440_+	NA	NA|385aa|up_6|CP025608.1_3113638_3114793_+	pfam00665, rve, Integrase core domain	NA|86aa|up_5|CP025608.1_3115026_3115284_+	NA	NA|104aa|up_4|CP025608.1_3115388_3115700_+	NA	NA|203aa|up_3|CP025608.1_3116119_3116728_+	NA	NA|470aa|up_2|CP025608.1_3116798_3118208_+	pfam00665, rve, Integrase core domain	NA|271aa|up_1|CP025608.1_3118204_3119017_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|421aa|up_0|CP025608.1_3120545_3121807_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	cas2|114aa|down_0|CP025608.1_3123609_3123951_-	COG1343, COG1343, CRISPR-associated protein Cas2 [Defense mechanisms]	cas1|339aa|down_1|CP025608.1_3123951_3124968_-	TIGR00287, CRISPR-associated_endonuclease_Cas1, CRISPR-associated endonuclease Cas1	csm6|383aa|down_2|CP025608.1_3124980_3126129_-	cd09699, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	csm5gr7|376aa|down_3|CP025608.1_3126224_3127352_-	COG1332, COG1332, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	csm4gr5|303aa|down_4|CP025608.1_3127348_3128257_-	COG1567, COG1567, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	csm3gr7|237aa|down_5|CP025608.1_3128237_3128948_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|125aa|down_6|CP025608.1_3128957_3129332_-	TIGR01870, CRISPR_type_III-associated_protein_Csm2, CRISPR type III-A/MTUBE-associated protein Csm2	cas10|816aa|down_7|CP025608.1_3129328_3131776_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	NA|182aa|down_8|CP025608.1_3132893_3133439_-	COG4293, COG4293, Uncharacterized protein conserved in bacteria [Function unknown]	NA|295aa|down_9|CP025608.1_3133710_3134595_-	COG2253, COG2253, Uncharacterized conserved protein [Function unknown]
GCA_002886585.1_ASM288658v1	CP025608	Mycobacterium tuberculosis strain GG-229-10 chromosome, complete genome	13	3740686-3741446	6	CRT	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	Orphan	CCGCCGNTNCCNCCGTNNCCGCC	23	0	0	NA	NA	NA	10	10	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	NA|160aa|up_9|CP025608.1_3720759_3721239_+,NA|164aa|down_2|CP025608.1_3753222_3753714_-,NA|79aa|down_5|CP025608.1_3755328_3755565_-	NA|160aa|up_9|CP025608.1_3720759_3721239_+	NA	NA|147aa|up_8|CP025608.1_3721257_3721698_+	cd04770, HTH_HMRTR, Helix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators	NA|290aa|up_7|CP025608.1_3721731_3722601_-	TIGR00766, Uncharacterized_protein_Dda3937_02003, inner membrane protein YhjD	NA|337aa|up_6|CP025608.1_3722621_3723632_-	PRK00927, PRK00927, tryptophanyl-tRNA synthetase; Reviewed	NA|211aa|up_5|CP025608.1_3723916_3724549_+	PRK14875, PRK14875, acetoin dehydrogenase E2 subunit dihydrolipoyllysine-residue acetyltransferase; Provisional	NA|410aa|up_4|CP025608.1_3724615_3725845_-	PRK08299, PRK08299, NADP-dependent isocitrate dehydrogenase	NA|450aa|up_3|CP025608.1_3726127_3727477_+	PRK07812, PRK07812, O-acetylhomoserine aminocarboxypropyltransferase; Validated	NA|380aa|up_2|CP025608.1_3727488_3728628_+	PRK00175, metX, homoserine O-acetyltransferase; Provisional	NA|244aa|up_1|CP025608.1_3728624_3729356_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|2524aa|up_0|CP025608.1_3729364_3736936_-	pfam00823, PPE, PPE family	NA|86aa|down_0|CP025608.1_3743198_3743456_-	pfam11222, DUF3017, Protein of unknown function (DUF3017)	NA|3158aa|down_1|CP025608.1_3743711_3753185_-	pfam00823, PPE, PPE family	NA|164aa|down_2|CP025608.1_3753222_3753714_-	NA	NA|149aa|down_3|CP025608.1_3753810_3754257_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|273aa|down_4|CP025608.1_3754293_3755112_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|79aa|down_5|CP025608.1_3755328_3755565_-	NA	NA|3717aa|down_6|CP025608.1_3755952_3767103_-	pfam00823, PPE, PPE family	NA|265aa|down_7|CP025608.1_3767346_3768141_-	pfam08031, BBE, Berberine and berberine like	NA|124aa|down_8|CP025608.1_3768222_3768594_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|137aa|down_9|CP025608.1_3768491_3768902_-	pfam01565, FAD_binding_4, FAD binding domain
GCA_002886585.1_ASM288658v1	CP025608	Mycobacterium tuberculosis strain GG-229-10 chromosome, complete genome	15	4110663-4110751	10	CRISPRCasFinder	no		csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	Orphan	GCTCGGCGACGATGCGGGCCGGATGACGGCC	31	0	0	NA	NA	NA	1	1	Orphan	csa3,c2c9_V-U4,cas3,DinG,WYL,cas4,DEDDh,cas2,cas1,csm6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	NA|65aa|up_9|CP025608.1_4099354_4099549_+,NA|257aa|up_7|CP025608.1_4101249_4102020_-,NA|90aa|up_2|CP025608.1_4107241_4107511_+,NA|233aa|up_0|CP025608.1_4109767_4110466_-,NA|126aa|down_6|CP025608.1_4115986_4116364_+	NA|65aa|up_9|CP025608.1_4099354_4099549_+	NA	NA|288aa|up_8|CP025608.1_4099631_4100495_+	TIGR01490, Uncharacterized_protein_Rv3661/MT3761, HAD-superfamily subfamily IB hydrolase, TIGR01490	NA|257aa|up_7|CP025608.1_4101249_4102020_-	NA	NA|549aa|up_6|CP025608.1_4102016_4103663_-	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|267aa|up_5|CP025608.1_4103659_4104460_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|309aa|up_4|CP025608.1_4104515_4105442_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|542aa|up_3|CP025608.1_4105443_4107069_-	cd00995, PBP2_NikA_DppA_OppA_like, The substrate-binding domain of an ABC-type nickel/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|90aa|up_2|CP025608.1_4107241_4107511_+	NA	NA|652aa|up_1|CP025608.1_4107776_4109732_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|233aa|up_0|CP025608.1_4109767_4110466_-	NA	NA|173aa|down_0|CP025608.1_4110811_4111330_+	pfam07332, Phage_holin_3_6, Putative Actinobacterial Holin-X, holin superfamily III	NA|328aa|down_1|CP025608.1_4111330_4112314_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|398aa|down_2|CP025608.1_4112306_4113500_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|274aa|down_3|CP025608.1_4113505_4114327_-	cd03426, CoAse, Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes	NA|228aa|down_4|CP025608.1_4114458_4115142_-	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|246aa|down_5|CP025608.1_4115141_4115879_-	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|126aa|down_6|CP025608.1_4115986_4116364_+	NA	NA|225aa|down_7|CP025608.1_4116462_4117137_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|265aa|down_8|CP025608.1_4117242_4118037_-	cd16278, metallo-hydrolase-like_MBL-fold, uncharacterized subgroup of the MBL-fold_metallo-hydrolase superfamily; MBL-fold metallo hydrolase domain	NA|152aa|down_9|CP025608.1_4118043_4118499_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
