assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000016365.1_ASM1636v1	NC_009338	Mycolicibacterium gilvum PYR-GCK, complete sequence	1	337343-337448	1	CRISPRCasFinder	no		csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal	Orphan	CACGACGGCGGCGCCGACGGCACCGCCT	28	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal,csf1gr8,csf4gr11,csf2gr7,csf3gr5,c2c9_V-U4,Cas14u_CAS-V	NA,NA|107aa|down_3|NC_009338.1_340864_341185_+,NA|87aa|down_9|NC_009338.1_347331_347592_+	NA|103aa|up_9|NC_009338.1_322661_322970_-	pfam00934, PE, PE family	NA|1329aa|up_8|NC_009338.1_322969_326956_-	TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa	NA|513aa|up_7|NC_009338.1_326952_328491_-	pfam05108, T7SS_ESX1_EccB, Type VII secretion system ESX-1, transport TM domain B	NA|617aa|up_6|NC_009338.1_328487_330338_-	TIGR03922, T7SS_EccA, type VII secretion AAA-ATPase EccA	NA|302aa|up_5|NC_009338.1_330505_331411_-	COG3662, COG3662, Uncharacterized protein conserved in bacteria [Function unknown]	NA|210aa|up_4|NC_009338.1_331510_332140_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|192aa|up_3|NC_009338.1_332146_332722_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|379aa|up_2|NC_009338.1_332830_333967_+	COG1647, COG1647, Esterase/lipase [General function prediction only]	NA|316aa|up_1|NC_009338.1_333963_334911_+	cd12175, 2-Hacid_dh_11, Putative D-isomer specific 2-hydroxyacid dehydrogenases, NAD-binding and catalytic domains	NA|729aa|up_0|NC_009338.1_334971_337158_+	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|403aa|down_0|NC_009338.1_337460_338669_+	pfam08007, Cupin_4, Cupin superfamily protein	NA|301aa|down_1|NC_009338.1_338665_339568_+	COG4759, COG4759, Uncharacterized protein conserved in bacteria containing thioredoxin-like domain [Posttranslational modification, protein turnover, chaperones]	NA|424aa|down_2|NC_009338.1_339564_340836_+	pfam01593, Amino_oxidase, Flavin containing amine oxidoreductase	NA|107aa|down_3|NC_009338.1_340864_341185_+	NA	NA|570aa|down_4|NC_009338.1_341263_342973_-	PRK07788, PRK07788, acyl-CoA synthetase; Validated	NA|413aa|down_5|NC_009338.1_343029_344268_+	cd04865, LigD_Pol_like_2, LigD_Pol_like_2: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD, subgroup 2	NA|136aa|down_6|NC_009338.1_344425_344833_+	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|395aa|down_7|NC_009338.1_344974_346159_+	cd01152, ACAD_fadE6_17_26, Putative acyl-CoA dehydrogenases similar to fadE6, fadE17, and fadE26	NA|390aa|down_8|NC_009338.1_346151_347321_+	COG1960, CaiA, Acyl-CoA dehydrogenases [Lipid metabolism]	NA|87aa|down_9|NC_009338.1_347331_347592_+	NA
GCF_000016365.1_ASM1636v1	NC_009338	Mycolicibacterium gilvum PYR-GCK, complete sequence	2	4104312-4104400	2	CRISPRCasFinder	no		csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal	Orphan	GGCGGTGCGGGCGGCGAGAACGTCTCGACCGA	32	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal,csf1gr8,csf4gr11,csf2gr7,csf3gr5,c2c9_V-U4,Cas14u_CAS-V	NA,NA	NA|348aa|up_9|NC_009338.1_4092909_4093953_-	cd04685, Nudix_Hydrolase_26, Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|377aa|up_8|NC_009338.1_4093949_4095080_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|318aa|up_7|NC_009338.1_4095111_4096065_-	PRK07920, PRK07920, lipid A biosynthesis lauroyl acyltransferase; Provisional	NA|215aa|up_6|NC_009338.1_4096061_4096706_-	COG0558, PgsA, Phosphatidylglycerophosphate synthase [Lipid metabolism]	NA|183aa|up_5|NC_009338.1_4096720_4097269_-	cd01275, FHIT, FHIT (fragile histidine family): FHIT proteins, related to the HIT family carry a motif HxHxH/Qxx (x, is a hydrophobic amino acid), On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified into three  branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases	NA|696aa|up_4|NC_009338.1_4097270_4099358_-	PRK12305, thrS, threonyl-tRNA synthetase; Reviewed	NA|143aa|up_3|NC_009338.1_4099444_4099873_-	TIGR02611, Putative_membrane_protein, TIGR02611 family protein	NA|217aa|up_2|NC_009338.1_4099869_4100520_-	cd03443, PaaI_thioesterase, PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria	NA|166aa|up_1|NC_009338.1_4100577_4101075_+	pfam09348, DUF1990, Domain of unknown function (DUF1990)	NA|239aa|up_0|NC_009338.1_4101273_4101990_+	pfam03861, ANTAR, ANTAR domain	NA|571aa|down_0|NC_009338.1_4106079_4107792_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|333aa|down_1|NC_009338.1_4107905_4108904_+	PRK07877, PRK07877, Rv1355c family protein	NA|275aa|down_2|NC_009338.1_4108897_4109722_+	pfam00582, Usp, Universal stress protein family	NA|217aa|down_3|NC_009338.1_4109753_4110404_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|330aa|down_4|NC_009338.1_4110422_4111412_-	PRK07877, PRK07877, Rv1355c family protein	NA|555aa|down_5|NC_009338.1_4111474_4113139_-	cd11332, AmyAc_OligoGlu_TS, Alpha amylase catalytic domain found in oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), trehalose synthase (also called maltose alpha-D-glucosyltransferase), and related proteins	NA|437aa|down_6|NC_009338.1_4113570_4114881_+	cd14750, PBP2_TMBP, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose; possesses type 2 periplasmic binding fold	NA|311aa|down_7|NC_009338.1_4114877_4115810_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|276aa|down_8|NC_009338.1_4115806_4116634_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|410aa|down_9|NC_009338.1_4116638_4117868_+	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC
GCF_000016365.1_ASM1636v1	NC_009338	Mycolicibacterium gilvum PYR-GCK, complete sequence	3	4160507-4160585	3	CRISPRCasFinder	no	Cas9_archaeal,csa3	csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal	Unclear	GTAACGAACTGACGAACTGACTCA	24	0	0	NA	NA	NA	1	1	Unclear	csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal,csf1gr8,csf4gr11,csf2gr7,csf3gr5,c2c9_V-U4,Cas14u_CAS-V	NA|138aa|up_6|NC_009338.1_4154917_4155331_+,NA|80aa|up_5|NC_009338.1_4155774_4156014_+,NA|74aa|up_4|NC_009338.1_4156010_4156232_+,NA|88aa|up_2|NC_009338.1_4158789_4159053_-,NA|194aa|up_1|NC_009338.1_4159491_4160073_-,NA|83aa|up_0|NC_009338.1_4160164_4160413_-,NA|98aa|down_2|NC_009338.1_4163644_4163938_-,NA|98aa|down_7|NC_009338.1_4167310_4167604_+,NA|140aa|down_8|NC_009338.1_4167860_4168280_+	NA|252aa|up_9|NC_009338.1_4151926_4152682_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|381aa|up_8|NC_009338.1_4152678_4153821_+	PRK10549, PRK10549, two-component system sensor histidine kinase BaeS	NA|228aa|up_7|NC_009338.1_4153829_4154513_-	pfam03713, DUF305, Domain of unknown function (DUF305)	NA|138aa|up_6|NC_009338.1_4154917_4155331_+	NA	NA|80aa|up_5|NC_009338.1_4155774_4156014_+	NA	NA|74aa|up_4|NC_009338.1_4156010_4156232_+	NA	NA|382aa|up_3|NC_009338.1_4157179_4158325_-	pfam05065, Phage_capsid, Phage capsid family	NA|88aa|up_2|NC_009338.1_4158789_4159053_-	NA	NA|194aa|up_1|NC_009338.1_4159491_4160073_-	NA	NA|83aa|up_0|NC_009338.1_4160164_4160413_-	NA	Cas9_archaeal|134aa|down_0|NC_009338.1_4162940_4163342_-	pfam14279, HNH_5, HNH endonuclease	NA|66aa|down_1|NC_009338.1_4163338_4163536_-	TIGR01764, Probable_excisionase, DNA binding domain, excisionase family	NA|98aa|down_2|NC_009338.1_4163644_4163938_-	NA	NA|380aa|down_3|NC_009338.1_4164150_4165290_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|115aa|down_4|NC_009338.1_4165651_4165996_-	pfam12487, DUF3703, Protein of unknown function (DUF3703)	NA|230aa|down_5|NC_009338.1_4166023_4166713_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	csa3|110aa|down_6|NC_009338.1_4166724_4167054_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|98aa|down_7|NC_009338.1_4167310_4167604_+	NA	NA|140aa|down_8|NC_009338.1_4167860_4168280_+	NA	NA|119aa|down_9|NC_009338.1_4168300_4168657_-	smart00342, HTH_ARAC, helix_turn_helix, arabinose operon control protein
GCF_000016365.1_ASM1636v1	NC_009338	Mycolicibacterium gilvum PYR-GCK, complete sequence	4	4292367-4292501	4	CRISPRCasFinder	no		csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal	Orphan	CGGGGACCGCCACCGGGACGCGGACCACCG	30	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas3,DinG,cas4,casR,WYL,Cas9_archaeal,csf1gr8,csf4gr11,csf2gr7,csf3gr5,c2c9_V-U4,Cas14u_CAS-V	NA,NA|175aa|down_3|NC_009338.1_4295385_4295910_+	NA|316aa|up_9|NC_009338.1_4281542_4282490_-	COG1409, Icc, Predicted phosphohydrolases [General function prediction only]	NA|201aa|up_8|NC_009338.1_4282643_4283246_+	pfam12079, DUF3558, Protein of unknown function (DUF3558)	NA|545aa|up_7|NC_009338.1_4283253_4284888_+	COG2936, COG2936, Predicted acyl esterases [General function prediction only]	NA|200aa|up_6|NC_009338.1_4284876_4285476_-	pfam08819, DUF1802, Domain of unknown function (DUF1802)	NA|89aa|up_5|NC_009338.1_4285472_4285739_-	pfam10041, DUF2277, Uncharacterized conserved protein (DUF2277)	NA|271aa|up_4|NC_009338.1_4285737_4286550_+	PRK06190, PRK06190, enoyl-CoA hydratase; Provisional	NA|204aa|up_3|NC_009338.1_4286691_4287303_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|440aa|up_2|NC_009338.1_4287486_4288806_-	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|337aa|up_1|NC_009338.1_4288798_4289809_-	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|169aa|up_0|NC_009338.1_4289783_4290290_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|121aa|down_0|NC_009338.1_4293187_4293550_-	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|354aa|down_1|NC_009338.1_4293617_4294679_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|179aa|down_2|NC_009338.1_4294681_4295218_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|175aa|down_3|NC_009338.1_4295385_4295910_+	NA	NA|162aa|down_4|NC_009338.1_4295906_4296392_+	pfam14530, DUF4439, Domain of unknown function (DUF4439)	NA|589aa|down_5|NC_009338.1_4296395_4298162_-	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|544aa|down_6|NC_009338.1_4298189_4299821_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|404aa|down_7|NC_009338.1_4299854_4301066_-	cd11642, SUMT, Uroporphyrin-III C-methyltransferase (also known as S-Adenosyl-L-methionine:uroporphyrinogen III methyltransferase, SUMT)	NA|455aa|down_8|NC_009338.1_4301062_4302427_-	PRK01077, PRK01077, cobyrinate a,c-diamide synthase	NA|205aa|down_9|NC_009338.1_4302524_4303139_-	PRK05986, PRK05986, cob(I)yrinic acid a,c-diamide adenosyltransferase
