assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000328565.1_ASM32856v1	NC_019966	Mycobacterium sp. JS623, complete genome	1	570915-570979	1	CRISPRCasFinder	no		csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG	Orphan	AAGAAGGCTCCGGCCAAGAAGGC	23	1	1	570938-570956	NC_019966.1_5952664-5952646	NA	1	1	Orphan	csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG,cas6e,cas5,cas7,cse2gr11,cas8e,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA,NA|299aa|down_5|NC_019966.1_574206_575103_-	NA|194aa|up_9|NC_019966.1_560357_560939_-	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|476aa|up_8|NC_019966.1_560935_562363_-	COG3800, COG3800, Predicted transcriptional regulator [General function prediction only]	NA|273aa|up_7|NC_019966.1_562469_563288_+	COG3884, FatA, Acyl-ACP thioesterase [Lipid metabolism]	NA|429aa|up_6|NC_019966.1_563551_564838_+	PRK15063, PRK15063, isocitrate lyase; Provisional	NA|287aa|up_5|NC_019966.1_565026_565887_+	PRK07819, PRK07819, 3-hydroxybutyryl-CoA dehydrogenase; Validated	NA|282aa|up_4|NC_019966.1_565874_566720_-	pfam18741, MTES_1575, REase_MTES_1575	NA|290aa|up_3|NC_019966.1_566816_567686_-	TIGR03709, PPK2_rel_1, polyphosphate:nucleotide phosphotransferase, PPK2 family	NA|237aa|up_2|NC_019966.1_567678_568389_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|444aa|up_1|NC_019966.1_568491_569823_+	COG2733, COG2733, Predicted membrane protein [Function unknown]	NA|153aa|up_0|NC_019966.1_569932_570391_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|94aa|down_0|NC_019966.1_571093_571375_+	pfam10724, DUF2516, Protein of unknown function (DUF2516)	NA|146aa|down_1|NC_019966.1_571377_571815_+	pfam10783, DUF2599, Protein of unknown function (DUF2599)	NA|226aa|down_2|NC_019966.1_571815_572493_+	PRK00507, PRK00507, deoxyribose-phosphate aldolase; Provisional	NA|297aa|down_3|NC_019966.1_572489_573380_-	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|275aa|down_4|NC_019966.1_573376_574201_-	cd07581, nitrilase_3, Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases)	NA|299aa|down_5|NC_019966.1_574206_575103_-	NA	NA|172aa|down_6|NC_019966.1_575077_575593_-	pfam10698, DUF2505, Protein of unknown function (DUF2505)	NA|170aa|down_7|NC_019966.1_575615_576125_-	pfam10698, DUF2505, Protein of unknown function (DUF2505)	NA|349aa|down_8|NC_019966.1_576151_577198_+	PRK13903, murB, UDP-N-acetylmuramate dehydrogenase	NA|430aa|down_9|NC_019966.1_577213_578503_+	pfam17964, Big_10, Bacterial Ig domain
GCF_000328565.1_ASM32856v1	NC_019966	Mycobacterium sp. JS623, complete genome	2	4132892-4132977	2	CRISPRCasFinder	no	csa3	csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG	Type I-A	CGCTGCCGAGGATGCTGCCCAGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG,cas6e,cas5,cas7,cse2gr11,cas8e,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA,NA	NA|279aa|up_9|NC_019966.1_4120373_4121210_+	cd01409, SIRT4, SIRT4: Eukaryotic and prokaryotic group (class2) which includes human sirtuin SIRT4 and several bacterial homologs; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation	NA|367aa|up_8|NC_019966.1_4121248_4122349_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|480aa|up_7|NC_019966.1_4122345_4123785_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|89aa|up_6|NC_019966.1_4123873_4124140_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|104aa|up_5|NC_019966.1_4124151_4124463_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|937aa|up_4|NC_019966.1_4124624_4127435_-	TIGR00757, Ribonuclease_E/G-like_protein, ribonuclease, Rne/Rng family	NA|137aa|up_3|NC_019966.1_4127741_4128152_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|130aa|up_2|NC_019966.1_4128253_4128643_-	pfam14017, DUF4233, Protein of unknown function (DUF4233)	NA|478aa|up_1|NC_019966.1_4128639_4130073_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|884aa|up_0|NC_019966.1_4130069_4132721_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|429aa|down_0|NC_019966.1_4133494_4134781_-	COG3268, COG3268, Uncharacterized conserved protein [Function unknown]	NA|657aa|down_1|NC_019966.1_4134791_4136762_-	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|135aa|down_2|NC_019966.1_4136859_4137264_+	COG1733, COG1733, Predicted transcriptional regulators [Transcription]	NA|177aa|down_3|NC_019966.1_4137260_4137791_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|169aa|down_4|NC_019966.1_4137810_4138317_+	PRK06847, PRK06847, hypothetical protein; Provisional	NA|196aa|down_5|NC_019966.1_4138390_4138978_+	pfam18029, Glyoxalase_6, Glyoxalase-like domain	NA|162aa|down_6|NC_019966.1_4138949_4139435_-	pfam06737, Transglycosylas, Transglycosylase-like domain	NA|109aa|down_7|NC_019966.1_4139647_4139974_-	pfam06737, Transglycosylas, Transglycosylase-like domain	NA|235aa|down_8|NC_019966.1_4141034_4141739_-	PRK00576, PRK00576, molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional	NA|359aa|down_9|NC_019966.1_4141635_4142712_-	PRK11867, PRK11867, 2-oxoglutarate ferredoxin oxidoreductase subunit beta; Reviewed
GCF_000328565.1_ASM32856v1	NC_019966	Mycobacterium sp. JS623, complete genome	3	4713242-4713360	3	CRISPRCasFinder	no		csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG	Orphan	CGAGCGAGGACCGGAGCGAGCGGGAGTCGAGGCATGAGCACC	42	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG,cas6e,cas5,cas7,cse2gr11,cas8e,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA,NA|178aa|down_0|NC_019966.1_4713384_4713918_+,NA|77aa|down_1|NC_019966.1_4713925_4714156_-	NA|615aa|up_9|NC_019966.1_4700509_4702354_-	PRK05506, PRK05506, bifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein; Provisional	NA|308aa|up_8|NC_019966.1_4702353_4703277_-	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|270aa|up_7|NC_019966.1_4703446_4704256_-	cd01069, PBP2_PheC, Cyclohexadienyl dehydratase, a member of the type 2 periplasmic binding fold protein superfamily	NA|142aa|up_6|NC_019966.1_4704314_4704740_+	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|164aa|up_5|NC_019966.1_4704855_4705347_-	cd03379, beta_CA_cladeD, Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|635aa|up_4|NC_019966.1_4705479_4707384_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|420aa|up_3|NC_019966.1_4707415_4708675_-	cd04188, DPG_synthase, DPG_synthase is involved in protein N-linked glycosylation	NA|633aa|up_2|NC_019966.1_4708727_4710626_-	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|560aa|up_1|NC_019966.1_4710649_4712329_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|235aa|up_0|NC_019966.1_4712306_4713011_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|178aa|down_0|NC_019966.1_4713384_4713918_+	NA	NA|77aa|down_1|NC_019966.1_4713925_4714156_-	NA	NA|326aa|down_2|NC_019966.1_4714761_4715739_+	COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|304aa|down_3|NC_019966.1_4715747_4716659_+	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|608aa|down_4|NC_019966.1_4716655_4718479_+	PRK10261, PRK10261, glutathione transporter ATP-binding protein; Provisional	NA|557aa|down_5|NC_019966.1_4718536_4720207_+	cd08501, PBP2_Lpqw, The substrate-binding domain of mycobacterial lipoprotein Lpqw contains type 2 periplasmic binding fold	NA|429aa|down_6|NC_019966.1_4720193_4721480_-	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|406aa|down_7|NC_019966.1_4721568_4722786_+	cd01163, DszC, Dibenzothiophene (DBT) desulfurization enzyme C	NA|308aa|down_8|NC_019966.1_4722841_4723765_-	cd13606, PBP2_ProX_like, Bacterial substrate-binding protein ProX of ABC-type osmoregulated transporter and its related proteins; the type 2 periplasmic-binding protein fold	NA|237aa|down_9|NC_019966.1_4723775_4724486_-	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]
GCF_000328565.1_ASM32856v1	NC_019958	Mycobacterium sp. JS623 plasmid pMYCSM02, complete sequence	1	94617-95134	1,1	CRISPRCasFinder,CRT	no	cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DEDDh,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csf3gr5,csf2gr7,csf4gr11,csf1gr8	Type I-E	GGGCTCATCCCCGCAGGCGCGGGGAA,GGGCTCATCCCCGCAGGCGCGGGGAAGAC	26,29	0	0	NA	NA	I-E:I-E	8,8	8	TypeI-E	csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG,cas6e,cas5,cas7,cse2gr11,cas8e,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA|74aa|up_9|NC_019958.1_87435_87657_+,NA|237aa|up_7|NC_019958.1_88708_89419_+,NA|126aa|up_6|NC_019958.1_89498_89876_-,NA|200aa|up_4|NC_019958.1_90612_91212_-,NA|139aa|up_3|NC_019958.1_91547_91964_+,NA|195aa|up_2|NC_019958.1_92354_92939_+,NA|119aa|down_5|NC_019958.1_102881_103238_-,NA|88aa|down_6|NC_019958.1_103411_103675_+,NA|157aa|down_7|NC_019958.1_104106_104577_-,NA|244aa|down_8|NC_019958.1_104652_105384_-,NA|216aa|down_9|NC_019958.1_107409_108057_+	NA|74aa|up_9|NC_019958.1_87435_87657_+	NA	NA|299aa|up_8|NC_019958.1_87660_88557_+	cd18715, PIN_VapC-like, uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|237aa|up_7|NC_019958.1_88708_89419_+	NA	NA|126aa|up_6|NC_019958.1_89498_89876_-	NA	NA|221aa|up_5|NC_019958.1_89965_90628_-	pfam08808, RES, RES domain	NA|200aa|up_4|NC_019958.1_90612_91212_-	NA	NA|139aa|up_3|NC_019958.1_91547_91964_+	NA	NA|195aa|up_2|NC_019958.1_92354_92939_+	NA	NA|215aa|up_1|NC_019958.1_92935_93580_+	smart00953, RES, RES domain	NA|88aa|up_0|NC_019958.1_94079_94343_-	pfam07704, PSK_trans_fac, Rv0623-like transcription factor	cas5|228aa|down_0|NC_019958.1_95849_96533_-	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|377aa|down_1|NC_019958.1_96529_97660_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|217aa|down_2|NC_019958.1_97687_98338_-	pfam09485, CRISPR_Cse2, CRISPR-associated protein Cse2 (CRISPR_cse2)	cas8e|547aa|down_3|NC_019958.1_98334_99975_-	pfam09481, CRISPR_Cse1, CRISPR-associated protein Cse1 (CRISPR_cse1)	cas3|915aa|down_4|NC_019958.1_99971_102716_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|119aa|down_5|NC_019958.1_102881_103238_-	NA	NA|88aa|down_6|NC_019958.1_103411_103675_+	NA	NA|157aa|down_7|NC_019958.1_104106_104577_-	NA	NA|244aa|down_8|NC_019958.1_104652_105384_-	NA	NA|216aa|down_9|NC_019958.1_107409_108057_+	NA
GCF_000328565.1_ASM32856v1	NC_019959	Mycobacterium sp. JS623 plasmid pMYCSM03, complete sequence	1	39563-39752	1	CRISPRCasFinder	no	c2c9_V-U4	DEDDh,c2c9_V-U4,csf3gr5,csf2gr7,csf4gr11,csf1gr8	Type V-U4	GACTCACCCGTGTGCGCGTGGGGCGCAC	28	0	0	NA	NA	NA	2	2	TypeV-U4	csa3,WYL,cas3,c2c9_V-U4,RT,cas4,DEDDh,Cas14u_CAS-V,DinG,cas6e,cas5,cas7,cse2gr11,cas8e,csf3gr5,csf2gr7,csf4gr11,csf1gr8	NA|401aa|up_9|NC_019959.1_31472_32675_+,NA|95aa|up_8|NC_019959.1_32699_32984_-,NA|77aa|up_7|NC_019959.1_33060_33291_+,NA|88aa|up_6|NC_019959.1_33291_33555_+,NA|125aa|up_5|NC_019959.1_33796_34171_-,NA|195aa|up_4|NC_019959.1_34176_34761_-,NA|120aa|up_1|NC_019959.1_38587_38947_-,NA|143aa|up_0|NC_019959.1_39109_39538_+,NA|118aa|down_2|NC_019959.1_42565_42919_-,NA|142aa|down_4|NC_019959.1_43770_44196_-,NA|159aa|down_5|NC_019959.1_44209_44686_-,NA|68aa|down_7|NC_019959.1_45102_45306_+,NA|254aa|down_9|NC_019959.1_45730_46492_-	NA|401aa|up_9|NC_019959.1_31472_32675_+	NA	NA|95aa|up_8|NC_019959.1_32699_32984_-	NA	NA|77aa|up_7|NC_019959.1_33060_33291_+	NA	NA|88aa|up_6|NC_019959.1_33291_33555_+	NA	NA|125aa|up_5|NC_019959.1_33796_34171_-	NA	NA|195aa|up_4|NC_019959.1_34176_34761_-	NA	NA|790aa|up_3|NC_019959.1_34773_37143_-	PRK07003, PRK07003, DNA polymerase III subunit gamma/tau	NA|203aa|up_2|NC_019959.1_37943_38552_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|120aa|up_1|NC_019959.1_38587_38947_-	NA	NA|143aa|up_0|NC_019959.1_39109_39538_+	NA	c2c9_V-U4|151aa|down_0|NC_019959.1_39759_40212_-	pfam07282, OrfB_Zn_ribbon, Putative transposase DNA-binding domain	NA|86aa|down_1|NC_019959.1_41505_41763_+	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|118aa|down_2|NC_019959.1_42565_42919_-	NA	NA|226aa|down_3|NC_019959.1_42976_43654_-	cd07500, HAD_PSP, phosphoserine phosphatase (PSP), similar to Methanococcus Jannaschii PSP and Saccharomyces cerevisiae SER2p	NA|142aa|down_4|NC_019959.1_43770_44196_-	NA	NA|159aa|down_5|NC_019959.1_44209_44686_-	NA	NA|84aa|down_6|NC_019959.1_44803_45055_-	COG4423, COG4423, Uncharacterized protein conserved in bacteria [Function unknown]	NA|68aa|down_7|NC_019959.1_45102_45306_+	NA	NA|127aa|down_8|NC_019959.1_45302_45683_+	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|254aa|down_9|NC_019959.1_45730_46492_-	NA
