assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010731675.1_ASM1073167v1	NZ_AP022584	Mycobacterium marseillense strain JCM 17324	1	838222-838341	1	CRISPRCasFinder	no		csa3,DEDDh,cas4,WYL,cas3,DinG	Orphan	GTCGCATGCCAGCGGCTGTGGGCCGACGCCGAGCGGTCGGCAGAC	45	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas4,WYL,cas3,DinG	NA|224aa|up_9|NZ_AP022584.1_827996_828668_-,NA|203aa|up_7|NZ_AP022584.1_830068_830677_-,NA|205aa|down_4|NZ_AP022584.1_841062_841677_+	NA|224aa|up_9|NZ_AP022584.1_827996_828668_-	NA	NA|273aa|up_8|NZ_AP022584.1_829161_829980_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|203aa|up_7|NZ_AP022584.1_830068_830677_-	NA	NA|161aa|up_6|NZ_AP022584.1_831122_831605_+	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily	NA|290aa|up_5|NZ_AP022584.1_831653_832523_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|143aa|up_4|NZ_AP022584.1_833948_834377_-	pfam00553, CBM_2, Cellulose binding domain	NA|262aa|up_3|NZ_AP022584.1_834508_835294_-	COG3971, COG3971, 2-keto-4-pentenoate hydratase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|194aa|up_2|NZ_AP022584.1_835476_836058_-	pfam01872, RibD_C, RibD C-terminal domain	NA|217aa|up_1|NZ_AP022584.1_836376_837027_+	pfam01083, Cutinase, Cutinase	NA|275aa|up_0|NZ_AP022584.1_837161_837986_-	TIGR03666, Rv2061_F420, PPOX class probable F420-dependent enzyme, Rv2061 family	NA|218aa|down_0|NZ_AP022584.1_838589_839243_-	COG4185, COG4185, Uncharacterized protein conserved in bacteria [Function unknown]	NA|119aa|down_1|NZ_AP022584.1_839256_839613_-	TIGR01552, Hypothetical_protein_Rv3357/MT3465/Mb3392	NA|189aa|down_2|NZ_AP022584.1_839803_840370_+	COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones]	NA|148aa|down_3|NZ_AP022584.1_840399_840843_-	pfam12680, SnoaL_2, SnoaL-like domain	NA|205aa|down_4|NZ_AP022584.1_841062_841677_+	NA	NA|254aa|down_5|NZ_AP022584.1_841707_842469_-	PRK06500, PRK06500, SDR family oxidoreductase	NA|153aa|down_6|NZ_AP022584.1_842533_842992_-	pfam13924, Lipocalin_5, Lipocalin-like domain	NA|228aa|down_7|NZ_AP022584.1_843264_843948_+	pfam11139, SfLAP, Sap, sulfolipid-1-addressing protein	NA|256aa|down_8|NZ_AP022584.1_843999_844767_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|412aa|down_9|NZ_AP022584.1_844763_845999_-	TIGR03106, trio_M42_hydro, hydrolase, peptidase M42 family
GCF_010731675.1_ASM1073167v1	NZ_AP022584	Mycobacterium marseillense strain JCM 17324	2	1458118-1458257	2	CRISPRCasFinder	no		csa3,DEDDh,cas4,WYL,cas3,DinG	Orphan	CCGCATCGGCCTCCGGCTCGCCG	23	1	1	1458141-1458177	NZ_AP022584.1_1458285-1458321	NA	2	2	Orphan	csa3,DEDDh,cas4,WYL,cas3,DinG	NA|104aa|up_4|NZ_AP022584.1_1452894_1453206_-,NA	NA|196aa|up_9|NZ_AP022584.1_1446309_1446897_+	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|491aa|up_8|NZ_AP022584.1_1446983_1448456_+	pfam01654, Cyt_bd_oxida_I, Cytochrome bd terminal oxidase subunit I	NA|347aa|up_7|NZ_AP022584.1_1448497_1449538_+	pfam02322, Cyt_bd_oxida_II, Cytochrome bd terminal oxidase subunit II	NA|538aa|up_6|NZ_AP022584.1_1449600_1451214_+	COG4988, CydD, ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones]	NA|559aa|up_5|NZ_AP022584.1_1451200_1452877_+	TIGR02868, ABC_transporter_ATP-binding_protein, thiol reductant ABC exporter, CydC subunit	NA|104aa|up_4|NZ_AP022584.1_1452894_1453206_-	NA	NA|466aa|up_3|NZ_AP022584.1_1453384_1454782_-	COG2898, COG2898, Uncharacterized conserved protein [Function unknown]	NA|282aa|up_2|NZ_AP022584.1_1454925_1455771_-	COG1946, TesB, Acyl-CoA thioesterase [Lipid metabolism]	NA|473aa|up_1|NZ_AP022584.1_1455902_1457321_-	PRK06247, PRK06247, pyruvate kinase; Provisional	NA|154aa|up_0|NZ_AP022584.1_1457386_1457848_-	pfam10825, DUF2752, Protein of unknown function (DUF2752)	NA|272aa|down_0|NZ_AP022584.1_1459443_1460259_-	PRK13111, trpA, tryptophan synthase subunit alpha; Provisional	NA|420aa|down_1|NZ_AP022584.1_1460255_1461515_-	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|273aa|down_2|NZ_AP022584.1_1461589_1462408_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC	NA|229aa|down_3|NZ_AP022584.1_1462520_1463207_-	TIGR02234, conserved_hypothetical_protein, trp region conserved hypothetical membrane protein	NA|513aa|down_4|NZ_AP022584.1_1463199_1464738_-	PRK13571, PRK13571, anthranilate synthase component I; Provisional	NA|153aa|down_5|NZ_AP022584.1_1464832_1465291_+	cd03017, PRX_BCP, Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of  thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides	NA|370aa|down_6|NZ_AP022584.1_1465297_1466407_-	COG0387, ChaA, Ca2+/H+ antiporter [Inorganic ion transport and metabolism]	NA|116aa|down_7|NZ_AP022584.1_1466483_1466831_-	PRK00051, hisI, phosphoribosyl-AMP cyclohydrolase; Reviewed	NA|263aa|down_8|NZ_AP022584.1_1466827_1467616_-	PRK02083, PRK02083, imidazole glycerol phosphate synthase subunit HisF; Provisional	NA|267aa|down_9|NZ_AP022584.1_1467617_1468418_-	COG0483, SuhB, Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family [Carbohydrate transport and metabolism]
GCF_010731675.1_ASM1073167v1	NZ_AP022584	Mycobacterium marseillense strain JCM 17324	3	4219711-4219792	3	CRISPRCasFinder	no	DEDDh	csa3,DEDDh,cas4,WYL,cas3,DinG	Unclear	GCAGCGGGTCACCCGCATCGATT	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas4,WYL,cas3,DinG	NA|141aa|up_2|NZ_AP022584.1_4217331_4217754_-,NA	NA|620aa|up_9|NZ_AP022584.1_4210399_4212259_+	PRK14952, PRK14952, DNA polymerase III subunits gamma/tau	NA|421aa|up_8|NZ_AP022584.1_4212272_4213535_-	pfam02353, CMAS, Mycolic acid cyclopropane synthetase	NA|475aa|up_7|NZ_AP022584.1_4213579_4215004_-	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|146aa|up_6|NZ_AP022584.1_4215052_4215490_+	pfam10604, Polyketide_cyc2, Polyketide cyclase / dehydrase and lipid transport	NA|242aa|up_5|NZ_AP022584.1_4215495_4216221_-	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|115aa|up_4|NZ_AP022584.1_4216355_4216700_+	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|204aa|up_3|NZ_AP022584.1_4216713_4217325_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|141aa|up_2|NZ_AP022584.1_4217331_4217754_-	NA	NA|236aa|up_1|NZ_AP022584.1_4217765_4218473_-	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|410aa|up_0|NZ_AP022584.1_4218465_4219695_-	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	DEDDh|331aa|down_0|NZ_AP022584.1_4219854_4220847_+	PRK06063, PRK06063, DEDDh family exonuclease	NA|602aa|down_1|NZ_AP022584.1_4220863_4222669_-	PRK03739, PRK03739, 2-isopropylmalate synthase; Validated	NA|422aa|down_2|NZ_AP022584.1_4222954_4224220_+	PRK06635, PRK06635, aspartate kinase; Reviewed	NA|345aa|down_3|NZ_AP022584.1_4224220_4225255_+	PRK14874, PRK14874, aspartate-semialdehyde dehydrogenase; Provisional	NA|334aa|down_4|NZ_AP022584.1_4225418_4226420_+	pfam13810, DUF4185, Domain of unknown function (DUF4185)	NA|110aa|down_5|NZ_AP022584.1_4226465_4226795_-	pfam05305, DUF732, Protein of unknown function (DUF732)	NA|142aa|down_6|NZ_AP022584.1_4226994_4227420_+	pfam04592, SelP_N, Selenoprotein P, N terminal region	NA|174aa|down_7|NZ_AP022584.1_4227561_4228083_+	pfam06682, SARAF, SOCE-associated regulatory factor of calcium homoeostasis	NA|221aa|down_8|NZ_AP022584.1_4228211_4228874_+	pfam14032, PknH_C, PknH-like extracellular domain	NA|437aa|down_9|NZ_AP022584.1_4229052_4230363_+	TIGR03444, EgtA_Cys_ligase, ergothioneine biosynthesis glutamate--cysteine ligase EgtA
GCF_010731675.1_ASM1073167v1	NZ_AP022584	Mycobacterium marseillense strain JCM 17324	4	5405394-5405474	4	CRISPRCasFinder	no		csa3,DEDDh,cas4,WYL,cas3,DinG	Orphan	GCGGCGGAGCCGGGCGCGGCGGGTCG	26	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas4,WYL,cas3,DinG	NA|89aa|up_6|NZ_AP022584.1_5397285_5397552_-,NA|301aa|up_3|NZ_AP022584.1_5401823_5402726_+,NA|125aa|up_1|NZ_AP022584.1_5404102_5404477_-,NA	NA|158aa|up_9|NZ_AP022584.1_5394646_5395120_-	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|76aa|up_8|NZ_AP022584.1_5395224_5395452_+	pfam12277, DUF3618, Protein of unknown function (DUF3618)	NA|580aa|up_7|NZ_AP022584.1_5395438_5397178_-	pfam12077, DUF3556, Transmembrane protein of unknown function (DUF3556)	NA|89aa|up_6|NZ_AP022584.1_5397285_5397552_-	NA	NA|413aa|up_5|NZ_AP022584.1_5397948_5399187_-	PRK01346, PRK01346, enhanced intracellular survival protein Eis	NA|789aa|up_4|NZ_AP022584.1_5399382_5401749_+	COG2409, COG2409, Predicted drug exporters of the RND superfamily [General function prediction only]	NA|301aa|up_3|NZ_AP022584.1_5401823_5402726_+	NA	NA|399aa|up_2|NZ_AP022584.1_5402896_5404093_+	pfam17964, Big_10, Bacterial Ig domain	NA|125aa|up_1|NZ_AP022584.1_5404102_5404477_-	NA	NA|216aa|up_0|NZ_AP022584.1_5404717_5405365_-	PRK05359, PRK05359, oligoribonuclease; Provisional	NA|522aa|down_0|NZ_AP022584.1_5405565_5407131_+	pfam05872, DUF853, Bacterial protein of unknown function (DUF853)	NA|269aa|down_1|NZ_AP022584.1_5407243_5408050_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|451aa|down_2|NZ_AP022584.1_5408119_5409472_+	COG2270, COG2270, Permeases of the major facilitator superfamily [General function prediction only]	NA|258aa|down_3|NZ_AP022584.1_5409497_5410271_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|219aa|down_4|NZ_AP022584.1_5410340_5410997_-	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|530aa|down_5|NZ_AP022584.1_5411192_5412782_+	PLN02820, PLN02820, 3-methylcrotonyl-CoA carboxylase, beta chain	NA|666aa|down_6|NZ_AP022584.1_5412787_5414785_+	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|389aa|down_7|NZ_AP022584.1_5414781_5415948_+	cd01158, SCAD_SBCAD, Short chain acyl-CoA dehydrogenases and eukaryotic short/branched chain acyl-CoA dehydrogenases	NA|170aa|down_8|NZ_AP022584.1_5415944_5416454_+	cd03451, FkbR2, FkbR2 is a Streptomyces hygroscopicus protein with a hot dog fold that belongs to a conserved family of proteins found in prokaryotes and archaea but not in eukaryotes	NA|274aa|down_9|NZ_AP022584.1_5416450_5417272_+	COG2301, CitE, Citrate lyase beta subunit [Carbohydrate transport and metabolism]
