assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010731735.1_ASM1073173v1	NZ_AP022588	Mycolicibacterium sediminis strain JCM 17899	1	511125-511256	1	CRISPRCasFinder	no	DEDDh	cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	Unclear	GAAAGTGCGAGTGGACGCCTACGAAAAGCGACTCTCGTCG	40	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	NA|102aa|up_8|NZ_AP022588.1_501963_502269_-,NA|133aa|down_0|NZ_AP022588.1_511299_511698_-	NA|322aa|up_9|NZ_AP022588.1_500990_501956_-	cd09019, galactose_mutarotase_like, galactose mutarotase_like	NA|102aa|up_8|NZ_AP022588.1_501963_502269_-	NA	NA|569aa|up_7|NZ_AP022588.1_502284_503991_-	cd11478, SLC5sbd_u2, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|262aa|up_6|NZ_AP022588.1_504129_504915_+	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|366aa|up_5|NZ_AP022588.1_504911_506009_+	cd00608, GalT, Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate	NA|362aa|up_4|NZ_AP022588.1_506005_507091_+	PRK00555, PRK00555, galactokinase; Provisional	DEDDh|330aa|up_3|NZ_AP022588.1_507077_508067_-	PRK06063, PRK06063, DEDDh family exonuclease	NA|422aa|up_2|NZ_AP022588.1_508168_509434_+	COG0769, MurE, UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]	NA|234aa|up_1|NZ_AP022588.1_509484_510186_+	COG3442, COG3442, Predicted glutamine amidotransferase [General function prediction only]	NA|292aa|up_0|NZ_AP022588.1_510194_511070_-	pfam18741, MTES_1575, REase_MTES_1575	NA|133aa|down_0|NZ_AP022588.1_511299_511698_-	NA	NA|204aa|down_1|NZ_AP022588.1_511708_512320_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|111aa|down_2|NZ_AP022588.1_512334_512667_-	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|273aa|down_3|NZ_AP022588.1_512823_513642_+	COG0860, AmiC, N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane]	NA|147aa|down_4|NZ_AP022588.1_513638_514079_-	pfam10604, Polyketide_cyc2, Polyketide cyclase / dehydrase and lipid transport	NA|460aa|down_5|NZ_AP022588.1_514169_515549_+	COG0277, GlcD, FAD/FMN-containing dehydrogenases [Energy production and conversion]	NA|437aa|down_6|NZ_AP022588.1_515577_516888_+	pfam02353, CMAS, Mycolic acid cyclopropane synthetase	NA|674aa|down_7|NZ_AP022588.1_516884_518906_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|429aa|down_8|NZ_AP022588.1_518954_520241_-	pfam12897, Aminotran_MocR, Alanine-glyoxylate amino-transferase	NA|277aa|down_9|NZ_AP022588.1_520683_521514_+	COG5593, COG5593, Nucleic-acid-binding protein possibly involved in ribosomal biogenesis [Translation, ribosomal structure and biogenesis]
GCF_010731735.1_ASM1073173v1	NZ_AP022588	Mycolicibacterium sediminis strain JCM 17899	2	3748489-3748580	2	CRISPRCasFinder	no		cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	Orphan	CCGTCGGGGTTGGAGCCGCCGTC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	NA|132aa|up_7|NZ_AP022588.1_3740880_3741276_-,NA|128aa|up_6|NZ_AP022588.1_3741638_3742022_-,NA|77aa|up_4|NZ_AP022588.1_3744155_3744386_-,NA	NA|103aa|up_9|NZ_AP022588.1_3739818_3740127_+	pfam05305, DUF732, Protein of unknown function (DUF732)	NA|68aa|up_8|NZ_AP022588.1_3740235_3740439_+	PRK05387, PRK05387, histidinol-phosphate aminotransferase; Provisional	NA|132aa|up_7|NZ_AP022588.1_3740880_3741276_-	NA	NA|128aa|up_6|NZ_AP022588.1_3741638_3742022_-	NA	NA|662aa|up_5|NZ_AP022588.1_3742067_3744053_-	pfam00582, Usp, Universal stress protein family	NA|77aa|up_4|NZ_AP022588.1_3744155_3744386_-	NA	NA|243aa|up_3|NZ_AP022588.1_3744426_3745155_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|395aa|up_2|NZ_AP022588.1_3745094_3746279_+	PRK07538, PRK07538, hypothetical protein; Provisional	NA|294aa|up_1|NZ_AP022588.1_3746283_3747165_-	COG4759, COG4759, Uncharacterized protein conserved in bacteria containing thioredoxin-like domain [Posttranslational modification, protein turnover, chaperones]	NA|406aa|up_0|NZ_AP022588.1_3747161_3748379_-	pfam08007, Cupin_4, Cupin superfamily protein	NA|522aa|down_0|NZ_AP022588.1_3748916_3750482_+	cd09143, PLDc_vPLD1_2_like_bac_2, Catalytic domain, repeat 2, of uncharacterized bacterial proteins with similarity to vertebrate phospholipases, PLD1 and PLD2	NA|629aa|down_1|NZ_AP022588.1_3750483_3752370_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|344aa|down_2|NZ_AP022588.1_3752463_3753495_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|403aa|down_3|NZ_AP022588.1_3753599_3754808_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|108aa|down_4|NZ_AP022588.1_3754824_3755148_-	COG1359, COG1359, Uncharacterized conserved protein [Function unknown]	NA|342aa|down_5|NZ_AP022588.1_3755156_3756182_-	cd05284, arabinose_DH_like, D-arabinose dehydrogenase	NA|228aa|down_6|NZ_AP022588.1_3756187_3756871_-	pfam01883, FeS_assembly_P, Iron-sulfur cluster assembly protein	NA|346aa|down_7|NZ_AP022588.1_3756912_3757950_-	pfam04909, Amidohydro_2, Amidohydrolase	NA|486aa|down_8|NZ_AP022588.1_3758145_3759603_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|319aa|down_9|NZ_AP022588.1_3759639_3760596_+	TIGR03557, F420_G6P_family, F420-dependent oxidoreductase, G6PDH family
GCF_010731735.1_ASM1073173v1	NZ_AP022588	Mycolicibacterium sediminis strain JCM 17899	3	5136909-5136994	3	CRISPRCasFinder	no		cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	Orphan	CCCACGTTGGTGTTGTTGGACCC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	NA,NA	NA|283aa|up_9|NZ_AP022588.1_5122656_5123505_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|725aa|up_8|NZ_AP022588.1_5123622_5125797_-	COG3968, COG3968, Uncharacterized protein related to glutamine synthetase [General function prediction only]	NA|365aa|up_7|NZ_AP022588.1_5125923_5127018_+	cd05819, NHL, NHL repeat unit of beta-propeller proteins	NA|776aa|up_6|NZ_AP022588.1_5127045_5129373_+	TIGR01701, Hypothetical_protein_Rv2900c/MT2968/Mb2924c	NA|275aa|up_5|NZ_AP022588.1_5129365_5130190_+	PRK00724, PRK00724, formate dehydrogenase accessory sulfurtransferase FdhD	NA|593aa|up_4|NZ_AP022588.1_5130208_5131987_+	pfam13515, FUSC_2, Fusaric acid resistance protein-like	NA|427aa|up_3|NZ_AP022588.1_5132035_5133316_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|216aa|up_2|NZ_AP022588.1_5133587_5134235_-	PRK12553, PRK12553, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|195aa|up_1|NZ_AP022588.1_5134231_5134816_-	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|477aa|up_0|NZ_AP022588.1_5134954_5136385_-	PRK01490, tig, trigger factor; Provisional	NA|497aa|down_0|NZ_AP022588.1_5138322_5139813_+	cd07114, ALDH_DhaS, Uncharacterized Candidatus pelagibacter aldehyde dehydrogenase, DhaS-like	NA|460aa|down_1|NZ_AP022588.1_5139853_5141233_+	cd17371, MFS_MucK, Cis,cis-muconate transport protein and similar proteins of the Major Facilitator Superfamily	NA|484aa|down_2|NZ_AP022588.1_5142244_5143696_+	PRK13875, PRK13875, conjugal transfer protein TrbL; Provisional	NA|211aa|down_3|NZ_AP022588.1_5143916_5144549_+	pfam01083, Cutinase, Cutinase	NA|278aa|down_4|NZ_AP022588.1_5144635_5145469_+	PLN03138, PLN03138, Protein TOC75; Provisional	NA|564aa|down_5|NZ_AP022588.1_5145479_5147171_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|574aa|down_6|NZ_AP022588.1_5147404_5149126_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|265aa|down_7|NZ_AP022588.1_5149143_5149938_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|163aa|down_8|NZ_AP022588.1_5149941_5150430_-	PRK05571, PRK05571, ribose-5-phosphate isomerase B; Provisional	NA|563aa|down_9|NZ_AP022588.1_5150491_5152180_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins
GCF_010731735.1_ASM1073173v1	NZ_AP022588	Mycolicibacterium sediminis strain JCM 17899	4	5370837-5370950	4	CRISPRCasFinder	no	DinG	cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	Type IV-A	CGCGTGTCGCGTCGCCAGATGCACACTCGAG	31	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	NA|418aa|up_9|NZ_AP022588.1_5357826_5359080_+,NA|324aa|down_3|NZ_AP022588.1_5372652_5373624_-	NA|418aa|up_9|NZ_AP022588.1_5357826_5359080_+	NA	NA|279aa|up_8|NZ_AP022588.1_5359081_5359918_+	PRK06180, PRK06180, short chain dehydrogenase; Provisional	NA|274aa|up_7|NZ_AP022588.1_5359914_5360736_+	TIGR03971, short-chain_dehydrogenase/reductase_SDR, SDR family mycofactocin-dependent oxidoreductase	NA|156aa|up_6|NZ_AP022588.1_5360709_5361177_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|419aa|up_5|NZ_AP022588.1_5361249_5362506_+	cd17320, MFS_MdfA_MDR_like, Multidrug transporter MdfA and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|594aa|up_4|NZ_AP022588.1_5362586_5364368_+	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|789aa|up_3|NZ_AP022588.1_5364417_5366784_+	pfam03706, LPG_synthase_TM, Lysylphosphatidylglycerol synthase TM region	NA|149aa|up_2|NZ_AP022588.1_5366839_5367286_+	cd07821, PYR_PYL_RCAR_like, Pyrabactin resistance 1 (PYR1), PYR1-like (PYL), regulatory component of abscisic acid receptors (RCARs), and related proteins	NA|607aa|up_1|NZ_AP022588.1_5367278_5369099_+	cd16035, sulfatase_like, uncharacterized sulfatase subfamily	NA|546aa|up_0|NZ_AP022588.1_5369158_5370796_+	COG4108, PrfC, Peptide chain release factor RF-3 [Translation, ribosomal structure and biogenesis]	NA|259aa|down_0|NZ_AP022588.1_5371140_5371917_+	pfam14528, LAGLIDADG_3, LAGLIDADG-like domain	NA|127aa|down_1|NZ_AP022588.1_5371897_5372278_+	COG4578, GutM, Glucitol operon activator [Transcription]	NA|124aa|down_2|NZ_AP022588.1_5372274_5372646_+	pfam12823, DUF3817, Domain of unknown function (DUF3817)	NA|324aa|down_3|NZ_AP022588.1_5372652_5373624_-	NA	NA|618aa|down_4|NZ_AP022588.1_5373677_5375531_-	cd17504, MFS_MMR_MDR_like, Methylenomycin A resistance protein (also called MMR peptide)-like multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|200aa|down_5|NZ_AP022588.1_5375610_5376210_-	PRK00120, PRK00120, dITP/XTP pyrophosphatase; Reviewed	NA|261aa|down_6|NZ_AP022588.1_5376206_5376989_-	PRK00173, rph, ribonuclease PH; Reviewed	NA|258aa|down_7|NZ_AP022588.1_5377045_5377819_-	cd07716, RNaseZ_short-form-like_MBL-fold, uncharacterized bacterial subgroup of Ribonuclease Z, short form; MBL-fold metallo-hydrolase domain	NA|245aa|down_8|NZ_AP022588.1_5377993_5378728_-	pfam01694, Rhomboid, Rhomboid family	NA|324aa|down_9|NZ_AP022588.1_5378690_5379662_-	TIGR01136, Cysteine_synthase, cysteine synthase
GCF_010731735.1_ASM1073173v1	NZ_AP022588	Mycolicibacterium sediminis strain JCM 17899	5	6234646-6234777	5	CRISPRCasFinder	no		cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	Orphan	CCCGAGCTGCCCCCGCCGCCGCCCGAGGCCGCCGCA	36	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,DEDDh,casR,WYL,PD-DExK,cas4,DinG	NA,NA|68aa|down_3|NZ_AP022588.1_6236954_6237158_-,NA|104aa|down_4|NZ_AP022588.1_6245301_6245613_-,NA|64aa|down_5|NZ_AP022588.1_6245747_6245939_+	NA|152aa|up_9|NZ_AP022588.1_6225918_6226374_+	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily	NA|265aa|up_8|NZ_AP022588.1_6226378_6227173_-	pfam11228, DUF3027, Protein of unknown function (DUF3027)	NA|523aa|up_7|NZ_AP022588.1_6227367_6228936_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|164aa|up_6|NZ_AP022588.1_6228932_6229424_+	pfam10969, DUF2771, Protein of unknown function (DUF2771)	NA|335aa|up_5|NZ_AP022588.1_6229427_6230432_-	COG0435, ECM4, Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|293aa|up_4|NZ_AP022588.1_6230550_6231429_+	pfam18741, MTES_1575, REase_MTES_1575	NA|137aa|up_3|NZ_AP022588.1_6231471_6231882_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|135aa|up_2|NZ_AP022588.1_6232035_6232440_+	PRK11770, PRK11770, YccF domain-containing protein	NA|351aa|up_1|NZ_AP022588.1_6232436_6233489_+	PRK00164, moaA, GTP 3',8-cyclase MoaA	NA|83aa|up_0|NZ_AP022588.1_6233494_6233743_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|142aa|down_0|NZ_AP022588.1_6235542_6235968_-	cd00756, MoaE, MoaE family	NA|164aa|down_1|NZ_AP022588.1_6235993_6236485_-	cd00886, MogA_MoaB, MogA_MoaB family	NA|142aa|down_2|NZ_AP022588.1_6236481_6236907_-	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|68aa|down_3|NZ_AP022588.1_6236954_6237158_-	NA	NA|104aa|down_4|NZ_AP022588.1_6245301_6245613_-	NA	NA|64aa|down_5|NZ_AP022588.1_6245747_6245939_+	NA	NA|204aa|down_6|NZ_AP022588.1_6245985_6246597_+	pfam10698, DUF2505, Protein of unknown function (DUF2505)	NA|NA	NA	NA|NA	NA	NA|NA	NA
