assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002166795.1_ASM216679v1	NZ_CP019221	Mycobacterium chimaera strain CDC 2015-22-71, complete genome	1	513575-513663	1	CRISPRCasFinder	no	DEDDh	csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	Unclear	CCCTGGGGACCCGGCCCCGGCGG	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	NA|141aa|up_8|NZ_CP019221.1_503398_503821_-,NA|491aa|down_7|NZ_CP019221.1_520779_522252_-	NA|204aa|up_9|NZ_CP019221.1_502780_503392_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|141aa|up_8|NZ_CP019221.1_503398_503821_-	NA	NA|236aa|up_7|NZ_CP019221.1_503833_504541_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|410aa|up_6|NZ_CP019221.1_504533_505763_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	DEDDh|331aa|up_5|NZ_CP019221.1_505873_506866_+	PRK06063, PRK06063, DEDDh family exonuclease	NA|640aa|up_4|NZ_CP019221.1_506882_508802_-	PRK03739, PRK03739, 2-isopropylmalate synthase; Validated	NA|422aa|up_3|NZ_CP019221.1_509087_510353_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|353aa|up_2|NZ_CP019221.1_510353_511412_+	PRK14874, PRK14874, aspartate-semialdehyde dehydrogenase; Provisional	NA|381aa|up_1|NZ_CP019221.1_511408_512551_+	pfam13810, DUF4185, Domain of unknown function (DUF4185)	NA|151aa|up_0|NZ_CP019221.1_512645_513098_+	pfam04592, SelP_N, Selenoprotein P, N terminal region	NA|219aa|down_0|NZ_CP019221.1_513815_514472_+	pfam14032, PknH_C, PknH-like extracellular domain	NA|434aa|down_1|NZ_CP019221.1_514651_515953_+	TIGR03444, EgtA_Cys_ligase, ergothioneine biosynthesis glutamate--cysteine ligase EgtA	NA|438aa|down_2|NZ_CP019221.1_515949_517263_+	TIGR03440, egtB_TIGR03440, ergothioneine biosynthesis protein EgtB	NA|237aa|down_3|NZ_CP019221.1_517262_517973_+	TIGR03442, TIGR03442, ergothioneine biosynthesis protein EgtC	NA|322aa|down_4|NZ_CP019221.1_517972_518938_+	COG4301, COG4301, Uncharacterized conserved protein [Function unknown]	NA|386aa|down_5|NZ_CP019221.1_518934_520092_+	TIGR04343, pyridoxal-phosphate-dependent_transferase, ergothioneine biosynthesis PLP-dependent enzyme EgtE	NA|227aa|down_6|NZ_CP019221.1_520088_520769_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|491aa|down_7|NZ_CP019221.1_520779_522252_-	NA	NA|509aa|down_8|NZ_CP019221.1_522387_523914_+	PRK00047, glpK, glycerol kinase GlpK	NA|240aa|down_9|NZ_CP019221.1_523946_524666_-	COG1695, COG1695, Predicted transcriptional regulators [Transcription]
GCF_002166795.1_ASM216679v1	NZ_CP019221	Mycobacterium chimaera strain CDC 2015-22-71, complete genome	2	1805854-1805932	2	CRISPRCasFinder	no		csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	Orphan	TCCGCCGCGGAGGTCCGTGGGCGCC	25	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	NA,NA	NA|589aa|up_9|NZ_CP019221.1_1792540_1794307_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|186aa|up_8|NZ_CP019221.1_1794488_1795046_+	pfam12079, DUF3558, Protein of unknown function (DUF3558)	NA|177aa|up_7|NZ_CP019221.1_1795054_1795585_+	pfam12079, DUF3558, Protein of unknown function (DUF3558)	NA|168aa|up_6|NZ_CP019221.1_1795679_1796183_-	COG2062, SixA, Phosphohistidine phosphatase SixA [Signal transduction mechanisms]	NA|384aa|up_5|NZ_CP019221.1_1796346_1797498_+	COG0420, SbcD, DNA repair exonuclease [DNA replication, recombination, and repair]	NA|874aa|up_4|NZ_CP019221.1_1797494_1800116_+	COG4717, COG4717, Uncharacterized conserved protein [Function unknown]	NA|415aa|up_3|NZ_CP019221.1_1800211_1801456_+	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|545aa|up_2|NZ_CP019221.1_1801442_1803077_-	cd08501, PBP2_Lpqw, The substrate-binding domain of mycobacterial lipoprotein Lpqw contains type 2 periplasmic binding fold	NA|612aa|up_1|NZ_CP019221.1_1803111_1804947_-	PRK10261, PRK10261, glutathione transporter ATP-binding protein; Provisional	NA|303aa|up_0|NZ_CP019221.1_1804943_1805852_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|326aa|down_0|NZ_CP019221.1_1805974_1806952_-	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|164aa|down_1|NZ_CP019221.1_1807164_1807656_+	cd03379, beta_CA_cladeD, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|147aa|down_2|NZ_CP019221.1_1807734_1808175_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|310aa|down_3|NZ_CP019221.1_1808287_1809217_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|614aa|down_4|NZ_CP019221.1_1809216_1811058_+	PRK05506, PRK05506, bifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein; Provisional	NA|162aa|down_5|NZ_CP019221.1_1811116_1811602_+	TIGR00738, Putative_HTH-type_transcriptional_regulator, Rrf2 family protein	NA|130aa|down_6|NZ_CP019221.1_1811619_1812009_-	cd08351, ChaP_like, ChaP, an enzyme involved in the biosynthesis of the antitumor agent chartreusin (cha), and similar proteins	NA|138aa|down_7|NZ_CP019221.1_1812052_1812466_-	cd06587, VOC, vicinal oxygen chelate (VOC) family	NA|293aa|down_8|NZ_CP019221.1_1812532_1813411_+	pfam00296, Bac_luciferase, Luciferase-like monooxygenase	NA|289aa|down_9|NZ_CP019221.1_1813480_1814347_+	pfam13424, TPR_12, Tetratricopeptide repeat
GCF_002166795.1_ASM216679v1	NZ_CP019221	Mycobacterium chimaera strain CDC 2015-22-71, complete genome	3	2786260-2786373	3	CRISPRCasFinder	no		csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	Orphan	GGTGTGATCGCGGCTTCGCCGCTCAACA	28	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	NA,NA|90aa|down_3|NZ_CP019221.1_2789665_2789935_-,NA|101aa|down_5|NZ_CP019221.1_2791108_2791411_+	NA|494aa|up_9|NZ_CP019221.1_2777355_2778837_+	PRK00421, murC, UDP-N-acetylmuramate--L-alanine ligase; Provisional	NA|317aa|up_8|NZ_CP019221.1_2778833_2779784_+	COG1589, FtsQ, Cell division septal protein [Cell envelope biogenesis, outer membrane]	NA|386aa|up_7|NZ_CP019221.1_2780050_2781208_+	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|249aa|up_6|NZ_CP019221.1_2781270_2782017_+	COG1496, yfiH, Multicopper polyphenol oxidase (laccase) [Secondary metabolites biosynthesis, transport and catabolism]	NA|259aa|up_5|NZ_CP019221.1_2782022_2782799_+	COG0325, COG0325, Predicted enzyme with a TIM-barrel fold [General function prediction only]	NA|215aa|up_4|NZ_CP019221.1_2782865_2783510_+	COG1799, COG1799, Uncharacterized protein conserved in bacteria [Function unknown]	NA|97aa|up_3|NZ_CP019221.1_2783675_2783966_+	COG0762, COG0762, Predicted integral membrane protein [Function unknown]	NA|266aa|up_2|NZ_CP019221.1_2784229_2785027_+	COG3599, DivIVA, Cell division initiation protein [Cell division and chromosome partitioning]	NA|113aa|up_1|NZ_CP019221.1_2785120_2785459_+	pfam02322, Cyt_bd_oxida_II, Cytochrome bd terminal oxidase subunit II	NA|225aa|up_0|NZ_CP019221.1_2785462_2786137_-	COG1926, COG1926, Predicted phosphoribosyltransferases [General function prediction only]	NA|452aa|down_0|NZ_CP019221.1_2786643_2787999_+	PRK07906, PRK07906, hypothetical protein; Provisional	NA|177aa|down_1|NZ_CP019221.1_2788035_2788566_+	COG1881, COG1881, Phospholipid-binding protein [General function prediction only]	NA|367aa|down_2|NZ_CP019221.1_2788568_2789669_-	PRK05286, PRK05286, quinone-dependent dihydroorotate dehydrogenase	NA|90aa|down_3|NZ_CP019221.1_2789665_2789935_-	NA	NA|341aa|down_4|NZ_CP019221.1_2789931_2790954_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|101aa|down_5|NZ_CP019221.1_2791108_2791411_+	NA	NA|251aa|down_6|NZ_CP019221.1_2791485_2792238_+	TIGR03848, MSMEG_4193, probable phosphomutase, MSMEG_4193 family	NA|196aa|down_7|NZ_CP019221.1_2792299_2792887_+	TIGR03847, TIGR03847, conserved hypothetical protein	NA|263aa|down_8|NZ_CP019221.1_2792897_2793686_+	TIGR03843, Phosphatidylinositol_3-_and_4-kinase, conserved hypothetical protein	NA|256aa|down_9|NZ_CP019221.1_2793761_2794529_+	COG1218, CysQ, 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase [Inorganic ion transport and metabolism]
GCF_002166795.1_ASM216679v1	NZ_CP019221	Mycobacterium chimaera strain CDC 2015-22-71, complete genome	4	4287146-4287242	4	CRISPRCasFinder	no		csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	Orphan	TGCCCGGCGTCGGCTGTGGCTCCGG	25	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	NA,NA|153aa|down_4|NZ_CP019221.1_4293348_4293807_-	NA|301aa|up_9|NZ_CP019221.1_4277652_4278555_-	PRK03170, PRK03170, dihydrodipicolinate synthase; Provisional	NA|251aa|up_8|NZ_CP019221.1_4278684_4279437_-	PRK00847, thyX, FAD-dependent thymidylate synthase; Reviewed	NA|404aa|up_7|NZ_CP019221.1_4279486_4280698_-	COG3214, COG3214, Uncharacterized protein conserved in bacteria [Function unknown]	NA|245aa|up_6|NZ_CP019221.1_4280750_4281485_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|162aa|up_5|NZ_CP019221.1_4281584_4282070_-	pfam00186, DHFR_1, Dihydrofolate reductase	NA|267aa|up_4|NZ_CP019221.1_4282141_4282942_-	PRK01827, thyA, thymidylate synthase; Reviewed	NA|250aa|up_3|NZ_CP019221.1_4282984_4283734_+	pfam01738, DLH, Dienelactone hydrolase family	NA|371aa|up_2|NZ_CP019221.1_4283750_4284863_-	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|400aa|up_1|NZ_CP019221.1_4284859_4286059_-	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|261aa|up_0|NZ_CP019221.1_4286055_4286838_-	PRK07231, FabG-like, SDR family oxidoreductase	NA|1079aa|down_0|NZ_CP019221.1_4287265_4290502_-	PRK12999, PRK12999, pyruvate carboxylase; Reviewed	NA|509aa|down_1|NZ_CP019221.1_4290498_4292025_-	cd17631, FACL_FadD13-like, fatty acyl-CoA synthetase, including FadD13	NA|277aa|down_2|NZ_CP019221.1_4292073_4292904_+	pfam13622, 4HBT_3, Thioesterase-like superfamily	NA|154aa|down_3|NZ_CP019221.1_4292894_4293356_-	COG0655, WrbA, Multimeric flavodoxin WrbA [General function prediction only]	NA|153aa|down_4|NZ_CP019221.1_4293348_4293807_-	NA	NA|246aa|down_5|NZ_CP019221.1_4293803_4294541_-	PRK00048, PRK00048, dihydrodipicolinate reductase; Provisional	NA|453aa|down_6|NZ_CP019221.1_4294645_4296004_+	pfam02720, DUF222, Domain of unknown function (DUF222)	NA|361aa|down_7|NZ_CP019221.1_4296010_4297093_-	COG3804, COG3804, Uncharacterized conserved protein related to dihydrodipicolinate reductase [Function unknown]	NA|336aa|down_8|NZ_CP019221.1_4297089_4298097_-	cd03531, Rieske_RO_Alpha_KSH, The alignment model represents the N-terminal rieske iron-sulfur domain of KshA, the oxygenase component of 3-ketosteroid 9-alpha-hydroxylase (KSH)	NA|217aa|down_9|NZ_CP019221.1_4298218_4298869_+	TIGR03384, betaine_BetI, transcriptional repressor BetI
GCF_002166795.1_ASM216679v1	NZ_CP019221	Mycobacterium chimaera strain CDC 2015-22-71, complete genome	5	4880923-4881056	5	CRISPRCasFinder	no		csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	Orphan	GGGCGGTACCCCCGGCGCGGAATGCATCGTCGCCGGGCTAC	41	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,RT,DinG,cas4,WYL,c2c9_V-U4	NA,NA|79aa|down_5|NZ_CP019221.1_4889941_4890178_+	NA|118aa|up_9|NZ_CP019221.1_4868042_4868396_-	cd20610, Nitro_FMN_reductase_child_20, nitroreductase family protein	NA|217aa|up_8|NZ_CP019221.1_4868415_4869066_-	TIGR03401, Uncharacterized_protein_YFL061W/YNL335W, HD domain protein, cyanamide hydratase family	NA|315aa|up_7|NZ_CP019221.1_4869203_4870148_+	COG4977, COG4977, Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription]	NA|993aa|up_6|NZ_CP019221.1_4870151_4873130_-	PRK00068, PRK00068, hypothetical protein; Validated	NA|341aa|up_5|NZ_CP019221.1_4873203_4874226_-	COG3480, SdrC, Predicted secreted protein containing a PDZ domain [Signal transduction mechanisms]	NA|453aa|up_4|NZ_CP019221.1_4874350_4875709_+	COG5282, COG5282, Uncharacterized conserved protein [Function unknown]	NA|289aa|up_3|NZ_CP019221.1_4875744_4876611_+	TIGR03882, hypothetical_protein, bacteriocin biosynthesis cyclodehydratase domain	NA|448aa|up_2|NZ_CP019221.1_4876692_4878036_+	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|84aa|up_1|NZ_CP019221.1_4878045_4878297_-	pfam02467, Whib, Transcription factor WhiB	NA|707aa|up_0|NZ_CP019221.1_4878701_4880822_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|85aa|down_0|NZ_CP019221.1_4881067_4881322_+	TIGR02200, conserved_hypothetical_protein, Glutaredoxin-like protein	NA|311aa|down_1|NZ_CP019221.1_4881333_4882266_-	PRK00241, nudC, NAD(+) diphosphatase	NA|357aa|down_2|NZ_CP019221.1_4882265_4883336_-	pfam02254, TrkA_N, TrkA-N domain	NA|1116aa|down_3|NZ_CP019221.1_4883384_4886732_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|1042aa|down_4|NZ_CP019221.1_4886728_4889854_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|79aa|down_5|NZ_CP019221.1_4889941_4890178_+	NA	NA|262aa|down_6|NZ_CP019221.1_4890223_4891009_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|104aa|down_7|NZ_CP019221.1_4891011_4891323_+	COG3695, COG3695, Predicted methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|285aa|down_8|NZ_CP019221.1_4891329_4892184_-	TIGR02569, conserved_hypothetical_protein, TIGR02569 family protein	NA|396aa|down_9|NZ_CP019221.1_4892219_4893407_-	PRK07878, PRK07878, molybdopterin biosynthesis-like protein MoeZ; Validated
